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1. Introduction 

Let A; be a number field with set of valuations 0,k- Given an algebraic variety X defined 
over k we have the obvious inclusions 

X{k) ^ X{Ak) C II X{k,), 

where A is the diagonal embedding of the set X{k) of fc-rational points into the set X(Afc) of 
adeles of X. Moreover the set X(Afc) is empty if and only if Hi/Grifc -^i^i^) is empty and clearly 
provides a local obstruction to the existence of /c-rational points on X. Recall that a class 
of algebraic varieties X defined over k is said to satisfy the Hasse principle if X{k) 7^ 
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whenever X(Afc) 7^ 0. Likewise is said to satisfy weak approximation if whenever it is 
non-empty the image of under A is dense in X(Afc) in the product of z^-adic topologies. 

This paper is concerned with the Hasse principle and weak approximation for the class of 
varieties satisfying the Diophantine equation 

^(i) = N;^/fc(a^i,---,^n)7^0, (1.1) 

where N^/fc is a full norm form for an extension Kjk of number fields, and -P(t) is a polynomial 
over k. Thus if [iT : A;] = n and we fix a basis {wi, . . . for iiT as a vector space over A;, 
then 

NK/fc(xi, . . . , Xn) := NK/k{xiUi H h XnUJn)- 

Throughout this paper we will use N^/fc to denote a norm form, and N^/k to denote the 
corresponding field norm. 

Progress on this problem has been limited, and we begin by discussing what is known in the 
simplest cases. A crude measure of difficulty is given by the number of distinct roots of P{t) 
over an algebraic closure k. When Pit) is a non-zero constant polynomial the Hasse principle 
for (1.1) is known as the "Hasse norm principle". The validity of the Hasse norm principle for 
cyclic extensions K/k was established by Hasse himself, but for non-cyclic extensions there 
can be counterexamples. There is an extensive literature on the subject and it is known, for 
example, that the Hasse norm principle holds if the field K has prime degree over k (Bartels 
[1] , for example) ; or the Galois group of N/ k is dihedral, where N is the normal closure of K 
over k (Bartels [2]); or the extension K/k is Galois and every Sylow subgroup of the Galois 
group is cyclic (Gurak [16, Corollary 3.2]);. 

Following the work of Colliot-Thelcne and Sansuc [6, Proposition 9.1], we also have simple 
sufficient conditions to ensure that "weak approximation for norms" holds, by which we mean 
that weak approximation holds for (1.1) when P{t) is a non-zero constant polynomial. Let 

be the normal closure of K over k. Then weak approximation holds if cither the degree 
[K : k\ is prime, or if the Galois group of N/k has cyclic Sylow subgroups. In particular the 
latter result implies that it suffices for K/k to be cyclic. 

The next case to consider is that in which P{t) = cf^ for some c e k^ and some positive 
degree d. In this situation the Hasse principle and weak approximation may fail. However 
(1.1) is a principal homogeneous space under an algebraic fc-torus, and the work of Sansuc [23] 
and Voskresenskii [24] shows that the Brauer-Manin obstruction is the only obstruction to 
the Hasse principle and weak approximation on any smooth projective model of this variety. 

When [K : k] = 2 and P{t) has degree 3 or 4 then (1.1) defines a Chatelet surface. The 
arithmetic of such surfaces is well-understood. The Hasse principle and weak approximation 
may fail, but it has been shown by Colliot-Thelene, Sansuc and Swinnerton-Dyer [9] that 
all such failures are explained by the Brauer-Manin obstruction. The same conclusion is 
available when [K : k] = 3 and degP(t) ^ 3, by work of Colliot-Thelene and Salberger [4]. 

There have also been investigations into (1.1) when P{t) factors completely over k, with 
at most two roots. In this case one may write 

P(t) = c(t - a)"(t - 6)^ 

with a,b,c E k and u,v £ N. It is known that one has the Hasse principle and weak approx- 
imation whenever the Brauer-Manin obstruction is empty, providing that we work over the 
ground field k = Q. This was first proved under the assumption that gcd(u,v,n) = 1, by 
Heath-Brown and Skorobogatov [18], a condition that was subsequently removed by Colliot- 
Thelene, Harari and Skorobogatov [8]. While all the previous work described had been purely 
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algebraic, the approach used by Heath-Brown and Skorobogatov combined analytic machin- 
ery, in the form of the Hardy-Littlewood circle method, with the previous descent approach 
to the Brauer-Manin obstruction. The circle method can be adapted, with some effort, to 
apply to ground fields other than k = Q. However, for simplicity, this possibility was not 
explored in [18]. 

Very little is known about other polynomials P{t). When P{t) is a non-zero separable 
polynomial with degree at least 2, it is conjectured that the Hasse principle and weak ap- 
proximation hold whenever the Brauer-Manin obstruction is empty. When K/k is cyclic 
and Schinzel's Hypothesis is granted, work of Colliot-Thelene, Skorobogatov and Swinnerton- 
Dyer [11, Theorem 1.1] yields a positive answer to this question. Note that this result is 
already a special case of earlier work of Colliot-Thelene and Swinnerton-Dyer [10] on pencils 
of Scveri-Brauer varieties, but this connection is only made clear in the discussion [11, page 
10]. Note that when K/k is cyclic the Brauer-Manin obstruction is known to be empty if 
P{t) is irreducible over k. (This follows from Corollary 2.6(c) of Colliot-Thelene, Harari and 
Skorobogatov [8], which shows that the Brauer group contains only vertical elements when 
K/k is cyclic. However it is not hard to show that the vertical part of the Brauer group is 
trivial when P{t) is irreducible over k, using the remark on page 76 of [8].) 

Finally we mention that there is potential for tackling the case in which P{t) is an arbitrary 
polynomial which splits completely over k into d linear factors, at least in the case A; = Q, 
by using ideas from the work of Green and Tao [14] together with the main theorem from 
Green, Tao and Ziegler [15]. In this case the methods of Heath-Brown and Skorobogatov [18] 
reduce the problem (1.1) to one involving a system of equations 

^K/qi^i) + ajN^/Q(xo) = CiNK/qiji) 7^ 0, 

In the language of [14], this is a system of linear forms of finite complexity. The machinery 
of Green and Tao, and of Green, Tao and Ziegler, allows one to handle such systems when 
the norms are replaced by primes, and it seems reasonable to hope that a variant of the 
method would allow one to handle the problem above. This plan was first mentioned to us 
by Professor Wooley. 

It will be apparent from the foregoing survey that the most obvious open case is that in 
which P{t) is an irreducible quadratic over k, and this is the goal of the present paper. Again 
we shall be dependent on techniques from analytic number theory which have not been fully 
developed for ground fields other than /c = Q, so we shall confine attention to this latter case. 
With this restriction our goal will be to establish the Hasse principle and weak approximation 
for 

P{t)=NK/Q{xi,...,Xn)^0, (1.2) 

under suitable assumptions on the extension K/Q. Let | • [j^ denote the z/-adic norm, which 
we extend to vectors by setting |z|iy := maxi^j^^ if z = {zi, . . . , Zm)- When v = oo we 
will simply write | • joo = I • I- With this in mind the following is our main result. 

Theorem 1. — Let P{t) G Q[t] be an irreducible quadratic polynomial and let K be a quartic 
extension of Q containing a root of P{t). Suppose that, for every v G JIq, we are given a 
solution (t('^),x('^)) G Ql of (1.2). Let S C be any finite subset and let £ > 0. Then there 
is a solution (i,x) G of (1.2) such that 

|t-iM|^<£, |x-xW|^<£, (1.3) 

for every v E S. Thus the Hasse principle and weak approximation hold for (1.2). 
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It is interesting to note that our result is both unconditional and concerns field extensions 
K/<Q which may be non-cyclic. This marks a departure from the sort of results achieved in 
[11]. In fact our theorem answers in the affirmative a question posed by Colliot-Thelene, 
during the 2005 Bremen workshop "Rational points on curves — explicit methods", about 
the Hasse principle for (1.2) in the special case that K is a biquadratic extension containing 
a root of P{t) (cf. the questions at the close of §2 in the work of Colliot-Thelene, Harari and 
Skorobogatov [8]). 

The proof of Theorem 1 relies on techniques from analytic number theory and is inspired 
by work of Fouvry and Iwaniec [12], who proved that there are infinitely many primes p of the 
form a^+q^, with q also prime. More generally they showed how to produce primes of the form 

+ q'^ with q from any sufficiently dense set. Our argument involves many complexities of 
detail, but also one major simplification, since we have only to produce integers in Nj(/q(K^), 
rather than primes. 

We can generalise our result mildly, to include the case in which P{t) = cQ{t)'^ for an odd 
positive integer u, where Q{t) G Q[t] is an irreducible quadratic polynomial. This is achieved 
by establishing a bijection between solutions of (1.2) and solutions of the corresponding 
equation in which P{t) is quadratic. To do this we begin by choosing e, / G Z for which 
eu + 4/ = 1. The equation (1.2) becomes cQ(t)" = Nx/q(x), and raising to the power e 
we obtain c'^Q{ty^'^f = N^/q,(x)^, whence c'^Q{t) = N^/Q(x)'^(5(t)^-^. Since K has degree 4 
over Q, we deduce that Poit) := c^Qit) is a norm from K whenever P{t) = cQ{t)'^ is a norm 
from K. The converse deduction is similar. Thus if c^Q{t) = N^/q(x) then, raising both 
sides to the power u we find that ci-^-^Q(t)" = N;^/q(x)", whence cg(t)" = c^-''N;^/q(x)«. 
Thus P{t) is a norm from K whenever Po{t) is a norm from K. It is now immediate from 
Theorem 1 that we have the Hasse principle and weak approximation for P(t) = cQ{t)^. 

By a simple change of variable we may assume that P{t) = c(l — at"^) in Theorem 1, where 
c is a non-zero rational and a is a square-free integer. Let 

L := Q(V^). 

The fields to which our theorem applies take the shape K = L{-\fP), with ^ G L. In particular 

we have L ^ K m. the statement of Theorem 1. It turns out that most of our argument 
carries over to an arbitrary degree n extension K oi that contains L as a subfield. Given 
P{t) = c(l — at^) as above, we suppose that (t^*^), x^*^)) G Qj}"^''" are solutions of (1.2), for each 
V G r^Q. Then we want to determine conditions on beyond the hypothesis L <Z K, such 
that for any finite set S C JIq we can find a solution [t, x) G Q"'"'""^ of (1-2) for which the weak 
approximation condition (1.3) holds for each u ^ S. 

In pursuing this goal we may assume that {u\, . . . ,ujn} is an integral basis for the ring of 
integers Ok-, with ui = 1. By the transitivity of norms we have N^/q = -^l/q ° ^k/Lj since 
L C. K. Hence any norm from to Q is also a norm from L to Q. We will make frequent 
use of this fact in our work. 

Since (1 — at^) = Ni^/q{1 + t-\/a) it follows from the hypotheses of the theorem that the 
equation c = Ni/q{u + Vy/a) can be solved for u,v G Qi, for any v G ilq. The Hasse norm 
principle therefore implies that there exists 6 G such that 

c = Nl/q{6)-\ 

Thus it will suffice to work with the equation 

1 - at^ = Ar^/Q((5)N^/Q(x) ^ 0, (1.4) 
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rather than (1.2), with square-free a G Z and non-zero 6 & L. Wc arc then given a finite set 
S C and a solution (t(^),x('')) G Q""''"'^ of this equation for every ly G Oq and we wish to 
estabUsh the existence of a solution (i,x) G Q^^^ such that (1.3) holds for every u e S. Our 
plan is to achieve this by arranging that 

N^/q(w)(1 + tV^) = SNk/M + 0, (1.5) 

where 

Ni<r/L(yi, . . . , yn) := ^KjL^Vx^x H 1- Vn^v)- 

Then if fi\ = yiivi -|- • • • -|- Un^n and P2 = wiui -|- • • • -|- WnUn we obtain (1.4) on taking 
xiLOi -|- • • • -|- XnOOji to correspond to the element ^ = It will be convenient to write 

X = y.w~^ for the vector x produced by this construction. 

In fact we will establish the following result, which demonstrates the Hassc principle and 
weak approximation for (1.5), for any number field K of degree n that contains L. 

Theorem 2. — Let 6 and assume that L CI K. Suppose that, for every v G I^q, we 

are given a solution {t^'^\w^'^\y^'^^) G Q^"^"*^ of (1.5). Let S C JIq be any finite subset and 
let £ > 0. Then there is a solution (t,w,y) G Q^"'^^ of (1.5) such that 

\t - i(^) I, < e, I w - w^'^) I, < e, |y - y(^) |, < e, 

for every £ S. 

In § 2 wc will show how Theorem 1 follows from this result when K is a quadratic extension 
of L. This will be achieved via the following result. 

Lemma 1. — Let K be a quadratic extension of L. Let S C JIq be a finite set and let e > 
be given. Suppose that the equation 

c(l - at^) = N^/Qi^) / (1.6) 

has solutions {t^'^\x^'^^) everywhere locally. Then there exists 5 = G L with c = Ni^/q(5)~^ 
such that 

l + t^/^ = 5NK/L{yi)^0 (1.7) 
has solutions {t'^\x.^Q ^) everywhere locally, with 

for every u £ S. 

We should emphasise here that when we speak of local solutions wc arc thinking of zeros 
over Qjy of the polynomial, defined over Q^, which specifies the equation. In particular, 
elements of the completions L^, for fi] u, do not occur. 

It has been suggested to us by Professor Colliot-Thelene that the open descent method 
of Colliot-Thelene and Skorobogatov [7] might be used to establish a variant of Lemma 1 
in which K is an arbitrary finite extension of L. The proposed lemma would then give the 
same conclusion as Lemma 1, but under the assumption that the solutions {t^'^\x^'^'^) of (1.6) 
produce an adelic point orthogonal to the Brauer group of the variety. Once combined with 
Theorem 2, this should demonstrate that the Brauer-Manin obstruction to the Hasse principle 
and to weak approximation is the only one for (1.2), when P{t) G Q[t] is an irreducible 
quadratic polynomial and K is an arbitrary extension of Q containing a root of P{t). However 
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this would still leave open the difficult problem of calculating the Brauer group, which our 
route avoids. 

By Lemma 1, given local solutions of (1.6), we may produce an equation (1.5) in which 
w = (1, 0, 0, . . . , 0) and y = x, and which has corresponding local solutions suitably close to 
those of (1.6). We may then use Theorem 2 to produce a global solution of (1.5) close to 
the given local solutions. Finally, taking the norm from L to Q we obtain a suitable global 
solution of (1.6). It should be pointed out that this argument uses the fact that the map 
from (w,y) to x = y.w~^ is continuous for providing that we avoid a neighbourhood of 

= 0. We stress that the only point in the paper where we use our assumption that K/L is 
quadratic occurs in the proof of Lemma 1. 

We proceed to indicate the initial steps in our treatment of (1.5) in Theorem 2. A suitable 
value of t G Q will exist, providing that 

T^L/Q {5^K/L{y)) = 2N;^/q(w) ^ 0. (1.8) 
Moreover, we will then have 

' = 2 ^^/^ [ j ■ 

Thus if we have a solution of (1.8) in which y and w are sufficiently close to y*^*^-* and w*^'^^ 
it will be automatic that the corresponding solution t will be close to t^'^'> . It follows that we 
have only to establish a suitable Hasse principle and weak approximation result for (1.8). 

We must make a further manoeuvre before reaching our fundamental equation. As men- 
tioned above, the work of Fouvry and Iwaniec handles primes. Using standard machinery, 
prime numbers are dealt with by means of "Type I sums" and "Type II sums" . Of these. Type 
II sums involve bilinear forms in which the prime number is replaced by a product of integers 
uv, which have to lie in suitable ranges. In our case we can insist that our norm ^x/Liy) 
is a product Nx/l(u)N^/^(v), thereby eliminating the need to consider Type I sums. In- 
deed, since we can specify the sizes of u and v, the treatment of the Type II sums will also 
be simplified somewhat. Thus instead of attacking (1.8) we shall consider the Diophantine 
equation 

I^L/Q (<5Nx/i(u)N^/i(v)) = 2Nk/q{w) / 0, 

with the aim of finding suitably localised solutions 

(u,v,w) G Z" X X Z". 

Let a denote the non-trivial automorphism of L and suppose that {1,t} is a Z-basis for 
Ol, and hence also a Q-basis for L. For technical reasons it will be convenient to replace the 
trace Tr^/Q by a "skew-trace" 

Tr(x,y) := TrL/qixy" Dl^) 
for x,y e L, where Dl = t — t'^ . Thus {Dl) is the different of L/Q. On writing 

X = SNk/Uu), y = {NK/L{^)DLy 



our condition becomes 

Tr{x,y) = 2NK/Q{w)^0. 



(1.9) 
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We will count suitably restricted solutions of this equation. If JV is the number of such 
solutions we can write 

-^=11 l]a(a:)/S(y)A(TV(x,y)). 

Here the function a(x), respectively /3(y), counts appropriately restricted representations of x 
by (5N;^/^(u), respectively of y by (N;^/^(v)Z)l)'^. Moreover A(Z) counts suitably constrained 
solutions of I = 2N;^/q(w). 

Our expression for ^ can be viewed as a bilinear form. We have good techniques for 
estimating these, going back to the works of Vinogradov. However the methods are designed 
to produce upper bounds for bilinear forms in which we expect cancellation, while our problem 
is to establish an asymptotic formula for an expression in which all the terms are non-negative. 
We shall therefore split a{x) into two parts a{x) = a{x) + ao{x), and write 

^ = ^ + #, 

where ^ is a main term, and contains the contribution from a{x), while is an error 
term, and is the corresponding expression involving ao(x). Thus we will need a{x) to be a 
sufficiently simple function that we can compute ^ directly. Moreover we will want ao{x) 
to produce sufficient cancellation on average, so that a bilinear form estimation of S' can be 
achieved. The underlying principle here is exactly that which Linnik [22] developed in his 
"dispersion method" . 

In § 4 we will describe a general procedure for producing an approximation of the type 
q(x). In our context x runs over the ring Ol, but we will begin by presenting the method 
as it applies to sequences indexed by Z, since we hope this will prove to be of independent 
interest. With this choice made, our proof that the bilinear form (o makes a satisfactory 
overall contribution to the asymptotic formula is the subject of § 7. It is this part of our 
argument which is based on ideas from the work of Fouvry and Iwaniec [12], who provide a 
general framework for estimating sums of this sort. Finally, the asymptotic evaluation of ^ 
will be executed in § 8 and § 9. 

Acknowledgements. - Some of this work was done while the authors were visiting the 
Hausdorff Institute in Bonn and the Institute for Advanced Study in Princeton, and while the 
second author was visiting the Mathematical Sciences Research Institute in Berkeley. The 
hospitality and financial support of these bodies is gratefully acknowledged. While working 
on this paper the first author was supported by EPSRC grant number EP/E053262/1. 

2. Deduction of Theorem 1 

The goal of this section is to prove Lemma 1. As explained in the previous section, this is 
enough to allow the deduction of Theorem 1 from Theorem 2. Let P{t) = c(l — at'^), where 
c is a non-zero rational and a is a square-free integer. For the moment let K be an arbitrary 
number field of degree n containing L = Q{^/a). In particular n is even. Fix any 5o G 
such that c = A^l/q((^o)~^ and let S C JIq be finite. We may assume that S contains the 
archimedean valuation, together with any valuations that become ramified in K, and any 
valuations v G Qq for which v^(6o) / for some G above v. 

We proceed to look for a suitable S, fulfilling the conditions of Lemma 1, by examining 
values S = Soj'^^~^. We will see that when v ^ S any 7 G is acceptable. Thus our first 
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task is to find a value 7 which works for the "bad" places v E S. For such valuations we claim 
that there exist 7i'^'' , 72'^'' G Qu, with 

7M=7i^)+7(^)V^^0, 

such that 

It then suffices to choose 7 G so that 7 is close to j^'^^ for each v £ S, since then we may 
take to'"' = t^^^ and find a suitable Xq''^ close to x^''^ . 

To establish the claim we begin by noting that the form ^k/l decomposes as 

Nk/l = Ni + ATsV^, (2.1) 

over L, where Ni,N2 G Q[xi, . . . are forms of degree n/2. Setting do = Si + 82^/0, and 
7^'^) = ci + C2\/a, and multiplying through by 7^'^), we see that the equation becomes 

= (Cl - C2V^)(<5i + 52V^)(iVi(xM) + iV2(x(''))V^). 

Thus our problem is to show the existence of (ci, C2) G satisfying this, given the condition 
(1.4), namely 

1 - a&'>^ = {6j - a(5i)(A^i(x('^))2 - aN2{^^'''>f) + 0. (2.3) 

If we set Ax = 5iiVi(x('^)) + a52iV2(x('')) and A2 = (5iAr2(xM) + S2Ni{x(''^) for convenience, 
then (2.2) becomes a pair of conditions 

ci(l - ^1) + C2{a&'^ + aA2) = ci{&^ - A2) + 02(^1 + 1) = 0. (2.4) 

We need to find a solution ci, C2 of these, with — ac^ 7^ 0. In doing so we may assume (2.3), 
which becomes 

1 - a&^^ = A\- aAl ^ 0. 
However the determinant of the system (2.4) is 

(1 - Ai){Ai + 1) - (aiM + aA2){&^ - A2) = (1 - a^^"") - {Al - aAl) 

= 0. 

Moreover, if ci = ±^/ac2 7^ one readily deduces from (2.2) that 1 — at^'^^'^ = 0, which is 
impossible. This suffices for the proof of the claim. 

In handling the case v ^ S the following lemma will be useful. It will be proved at the end 
of the section. 

Lemma 2. — Let v G Qq be a finite place, unramified in K. Suppose that 

/3 = bi + b2Va € L"" 

is a unit in L^ for each place p, of L above u. Then /5 = Nx/l(x) for some x G Q^, by which 
we mean that if Ni and N2 are as in (2.1) then bi = A''i(x) and 62 = -^2(x). 

While Lemma 2 is valid for arbitrary extensions K of L, we now make the assumption that 
K is a quadratic extension of L. In particular we have n = 4 in the above discussion. 
When V ^ S our task is to show that if 

fit) := {l+tV^)jir)-%\ 
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then there exists Iq''^ G Qu such that /{to'^) is of the form JSSj^/i{xq^) for some vector 

Xg'^^ G Q^. Since u ^ S there are no weak approximation conditions to be satisfied. Wc begin 
by considering the case in which v ^ S is inert in L/Q. Then i'i/(7'^7~^) = for any 7, and 
Vi/{So) = by the choice of S, whence jij'^)^^^^^ will be a unit in Lj,. Lemma 2 then shows 
that there exists Xq^^ g Qf, such that /(O) = Nx/l{'^o'^)- It therefore suffices to take = 
in (1.7). 

Finally we must deal with the case v ^ S, with v split in L. Suppose v splits as fii and 
/Li2 = /^i in L. Write p for the rational prime associated to u. Let 

^m(7) = ei, ^^2(7) = 62- 

Let pi and p2 = be the prime ideals associated to fii and 1x2 respectively, so that (7) = 
prp2^0 for some ideal q coprime to both pi and p2- Choose cci G pi \ (pf U P2), and set 
a2 = al- We write e = 2 + |ei| + |e2| and 

^1 = 1 + + 1^21 + 62 - ei), h2 = l + \{\ei\ + |e2| + ei - 62), 

so that e, hi and /;,2 arc integers, and e is strictly positive. Since ai E L and K/L is quadratic, 
we have Nx/iif^i) = ctf • We may then calculate that 

V1J.1 (1 + P'^Va) = -e, (7) = ei, (7'^) = 62, 

and 

(A^K/L(at'a2')) = 2/iii;^i(ai) + 2/i2i;^i(a2) = 2/n, 



whence 

Similarly we have 



[fip-^NK/Liai'^')) = -e + ei - 62 + 2/ii = 0. 



V^, [f{p-nNK/L{c^l'»2'))=0- 

Thus Lemma 2 tells us that f {p^'^)N 1 i^{q^^ a^^) can be written as N^/^(y) for some y G Q^. 

It follows that f{p~'^) takes the form for some xq G Q^, whence 4'^'* = is 

acceptable in (1.7). This completes the proof of Lemma 1. 

It now remains to establish Lemma 2, for which we return to an arbitrary extension K of 
degree n over Q, which contains L as a subfield. It will be convenient to use the notation 
in for the embedding of L into the completion L^, and similarly for valuations of Q and K. 
Thus our hypothesis is that i^(/3) is a unit for every /x | u. For each such we choose a place 
A(jLt) G such that A(/x) | fi. Then the extension of local fields if_>^(^)/L^ is unramified, 
whence i^ifi) must be a norm from ^a(^) (sec Gras [13, Corollary 1.4.3, part (ii), page 75], 
for example). For each ji above v we may therefore write 

for appropriate elements ^ -^A(/i)- If A G lies above ji but is different from A(/Lt) we 
take yx = ix{'\.)-, so that 

V(1) = ^Ka/l,(?/a). 
We now use weak approximation to find elements y^^^ G K such that 



u(y^^^) - yx 



1 

< - 
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for every A € ^Ik above i/. We note in particular that the sequence [y^-'^) converges with 
respect to each valuation A above v. We now have 

lim A^K./L (^A(y(^■))) = ^ = ^^^^ ^ ' 

(1), otherwise, 

where the limit is with respect to | • |^. We therefore conclude that 
However, according to Gras [13, Proposition 2.2, page 93] wc have 
so that 

Since this holds for all /x above it follows that 

Ari(x(^)) ^ 6i and N2{x.^^^) ^ b2, 
the convergence being with respect to | • where 

y^^ = x'f^iOi + ■■■+ x'^^COn- 

Finally, since the sequence y^^ G K converges for every valuation A above v, the sequence 
x^-') G must converge in Q^,, yielding the required vector x G Q". 



3. Preliminaries for Theorem 2 

We are now ready to begin the proof of Theorem 2, which concerns the Hassc principle 
and weak approximation for the variety (1.5). For the remainder of the paper let K be an 
arbitrary number field of degree n that contains L = Q{^/a). As noted, it will be convenient 
to work with the equivalent variety (1.8). For ease of reference we repeat the definition here:- 

T^L/Q {S^K/Liy)) = 2N;^/q(w) 0. (3.1) 

We are presented with local solutions y*^'^) , w^'') for every valuation u, and wish to find a global 
solution which approximates these for every i' G S. We claim that it suffices to consider the 
case in which y('^), w^*^) are integral for every finite u E S. Indeed, let us suppose that we can 
solve (3.1) with the local conditions 

|y-y(-)|, <e, |w - w^'^)], < (3.2) 

providing that y('^), w^'^) are integral for all finite u G S. Let us suppose further that we have 
a general set of values for y '^'^^ , w^*") satisfying (3.1). Then we may choose an integer iV G N 
such that A^^y , Nw^^^ are integral for all finite v E S. We note that these values will still 
satisfy (3.1). Then, by our assumption, we can find a global solution y,w of (3.1) which 
satisfies 

|y - ivV^U < ^l^X, |w - ATw^l^ < e|iV|^ 
for all G S. It follows that N-'^y,N-^w is a solution of (3.1) fulfilling the condition (3.2). 
This establishes our claim. 
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We now use weak approximation for Z" to produce vectors y*^^^ , w^^^ G such that 

for all finite u e S. Thus (3.2) becomes 

y = y (mod M) , w = w^^^ (mod M) (3.3) 

for an appropriate modulus M G N. 

Having suitably re-interpreted the weak approximation conditions for the finite places we 
turn our attention to the infinite place. Here we use a similar re-scaling argument to conclude 
that if y, G N satisfy 

W = 1 (mod M) and Y = W^, 

then a solution y, w of (3.1) which satisfies both (3.3) and the u = oo constraints 

|y _ yy W I < ^Y, I w - "W^wW I < eW, (3.4) 

gives rise to a solution Y''^y,W~^w of (3.1) which meets the original condition (3.2). Since 
y*^"^) and w'^'^^ cannot vanish we may choose e sufficiently small that neither of y or w can be 
zero in (3.4). 

As in § 1 we replace the vector y by u and v to produce the variety (1.9), whose definition 
it is convenient to repeat here:- 

J : Tr{6NK/L{vi), (N;^/L(v)L>i)'^) = 2Nk/q{w) ^ 0. (3.5) 

It is clear that if u and v are sufficiently close to u^'^^ := y^^^^ and 

v^'^) := (1,0,0,..., 0) (3.6) 

in Qi, then y will be suitably close to y*-''-*. We therefore assume that (3.5) has local solutions 
u^'^^ v^'^^ w^*^) for all places v, with v^*^) given by (3.6), and we aim to find a global solution 
such that 

lu-u^'^)]^ <£, Iv-v^'^)!^ <e, |w-w('^)|^<£, (3.7) 

for u & S. Since u^'^^ and v^^^^ are integral at all finite G S we can re-interpret the 
corresponding conditions as congruences 

u = u(*^) (mod M) , V = v(*^) (mod M) 

with integer vectors u^*^^ and v*^^-* for which 

:= (1,0,0,..., 0). (3.8) 

For technical reasons we will move u*^'*-* in (3.5) very slightly, and make a corresponding 
adjustment in w*^^) to compensate, so as to ensure that 

^7^0, il^i^n). (3.9) 

For the infinite place we replace the parameter Y by two further values U and V satisfying 
UV = Y and impose the conditions 

|u - [/uW I < eu, |v - FvW I < eV 

instead of |y - yyW| < eY. 

We can now summarise our conclusions in the following result. 
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Lemma 3. — Suppose we are given local solutions of (3.5) for every valuation v ofQ, subject 
to the condition (3.6). Let e > also be given. Then there is a modulus M G N having \M\i, < 
1 for all finite ue S, and a solution (uW,vW,wW) of (3.5) overZ/MZ satisfying (3.8), 
having the following property. Let V, Hq be integer parameters with Hq = V = l(modM). 
Let H = Hq and suppose that V ^ H ^ 1. If 

U = HV, W = H^/'^V, 

then any solution (u, v, w) G Z^" of (3.5) satisfying 

u = uW (modM), 

v = (modM), 

w = (modM), 

and 

|u- [/u(^)| < eU, 
|v-I/v«| <eF, 
w — VFwW| < eW, 

gives rise to a global solution of (3.5) satisfying (3.7). Moreover, for every finite place v & S 
there is a solution (u,v,w) G Z^" of (3.5) satisfying (3.10). 

It might help the reader at this point to say more about the role of the parameters H and 
V. We shall think of H as being a small fixed power of V. When we estimate error terms 
in our analysis we cannot afford to lose any power of V, unless 6 can be taken arbitrarily 
small. On the other hand there will be certain points in our argument where we will lose 
factors of with arbitrary small r] > 0. This will not matter since we will make a key saving 
which is a power of H, so that there is a net gain overall. 

With this in mind, many of our estimates will involve factors of the type V'^^^\ These 
involve the standard convention that there are implicit order constants for each occurrence 
of the O(-) notation, which need not be the same on each occasion. Since we are taking 
the degree n of to be fixed, we will allow these implicit order constants to depend on n. 
Recalling that H ^ V we may replace terms involving any combinations of H^, and W^' 
by V'-^^^\ The number rj will be a sufficiently small positive constant, which will be fixed 
throughout the proof. We could have chosen to specify its value at the outset of the argument, 
but we feel it is more instructive merely to impose the condition that 77 is sufficiently small, 
at various points in the proof. 

We are now ready to cast our problem in terms of a bilinear form. If R is any ring it will 
be convenient to write ^ {R) for the set of solutions (u, v, w^) of the equation (3.5) in which 
each of the vectors has coordinates lying in R. In the light of Lemma 3 we introduce the 
counting function 

N{H,V) := #{(u,v,w) G ^(Z) : (3.10) and (3.11) hold}. 

Henceforth we will allow the constants implied by the notations 2> and O(-) to depend on 

uW,vW,wW,uW,vW,wW,M,(5,L,K, and e, 

which are to be regarded as fixed once and for all. 



(3.10) 



(3.11) 
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In our work wc will restrict the values over which v runs by stipulating that if N;^/^(v) = 
Ni(v) + N2(v)r then 

gcd(Ni(v),N2(v)) = 1. (3.12) 

In particular it follows that 

gcd(a,6) = 0(1) for (N;^/i(v)DL)^ = a + 6r. (3.13) 

We are now ready to specify the sets over which we will sum. In the case of the variable 
u there is a technical point to be dealt with in § 5. For the time being we give ourselves 
independent linear forms Li(u), . . . , L„(u) whose role will become clear later. Let G be a 
further parameter, tending to infinity with V, which we assume is in the range I ^ G ^ H. 
We then define the regions 

<^ := |u G : max \Li{u) - ?7Lj(uW)| < G'^u} , 

[ l^i^n J 

r := { V e : I V - Fv^ | < G" V} , 
W := {w -.Iw- Ww^l < G-^W}. 
In order to interpret our counting function N{H, V) as a bilinear form, we let 

a{x) := #{u G ^ n Z" : u = u^^^ (modM), SNk/l{u) = x} (3.15) 

and 

B(v)-=*lveynZ^ ■ v = vW(modM), 1 . . 

PKV) ■ ^ |v fc 7- M ^ . holds and (N;^/i(v)L»L)<^ = y / ' ^ ^ 

for x,y G Ol- Lastly we define the function 

A(/) := # |w G n Z" : w = w^^) (modM), 2N;^/q(w) = l} , (3.17) 

on Z. Notice that and w^"^) arc all non-zero, whence u, v and w will be non-zero 

throughout and if G is large enough. It follows in particular that a,/3 and A are 

supported on non-zero x, y G Oz, and / G Z. 
We now define the bilinear form 

^{G,H,V) «W/3(y)A(TV(x,y)). 

It is easy to check that N{H,V) ^ ^{G, H,V) whenever G ^ e^^, where the implied 
constant is allowed to depend on the linear forms Li, . . . , L„, a convention that we adhere to 
for the remainder of the paper. It now suffices to demonstrate that J\^{G, iJ, F) > for large 
values of G. Note that wc will ultimately take G = log V. 

Although we have framed <yV{G, H, V) as a bilinear form, it is not an upper bound for 
^{G, H,V) that we seek but an asymptotic formula as G ^ cxo. As indicated in the in- 
troduction we will begin by extracting a main term from our expression for ^(^G,H,V). 
Instrumental in this will be finding a decomposition a(x) = a(x) +ao(x), for an appropriate 
approximation a{x) to a{x). We will then write 

J^{G, H, V) = ^(G, H, V) + (G, H, V), (3.18) 

where 

^{G,H,V) «(x)/3(y)A(Tr(x,2/)) 
xeoL y&OL 
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is regarded as the main term and 

^{G,H,V) «o(x)^(y)A(TV(x,y)) 

is the error term. The handhng of S'{G, H, V) wih be executed in § 7 and the estimation of 
^{G, H, V) will be the subject of § 8 and § 9. 
Our treatment of # (G, H, V) requires bounds for 

xeR 

x=xo (mod h) 

uniformly for small moduli h and square regions R. Thus our approximation a will have 
to be such that ao averages to zero over all congruence classes to small moduli. This will 
be achieved via a quite general procedure described in the next section. Our estimate for 
S {G, H, V) will also require bounds for 

for which we have the following result. 
Lemma 4- — For any i] > we have 

xeoL yeoL 

and 

Since |ao(a^)P ^ 2|q;(x)P + 2|Q;(a;)p we will also require a bound for 

xeoL 

This will be established in § 5. We conclude this section by proving Lemma 4. We will discuss 

the upper bound for the case of f3, the remaining estimates being dealt with similarly. Let 
^('^) — (N_ft"/L(v)£>L)'^- We shall show that if v' is given, then there are Oj^(y) choices of v 
for which /i(v) = /i(v'). We set w = /i(v'), so that zu G Ol\{0}. If p = viUJi + - ■ G Ok, 

then p \ w in Ok, and the conjugates of p are all 0{V) in absolute magnitude. The number 
of admissible elements p is therefore 

for any > 0. If we now take = + n) it follows that the number of v corresponding to 
a given v' is Or/iV^), as required. 

The fact that a given value zu = h{v) is attained 0{V^) times will be used at various points 
in the rest of the paper without further comment. Similarly, we shall use estimates 0{V^) 
for the number of representations of w as N;^/£,(u) with u G or of Z as 2N;^/q(w) with 
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4. A general approximation principle 

Our goal now is to split a{x) into two parts a{x) = a{x) + ao{x), where a{x) is a sufficiently 
simple function that we can compute ^{G, H,V) directly. Moreover we will want ao{x) to 
produce sufficient cancellation on average, so that a bilinear form estimation of S'{G,H,V) 
can be achieved. 

Wc start by describing a general procedure for producing an approximation of the type 
a{x). In our context x runs over the ring o^, but we will begin by presenting the method 
as it applies to sequences indexed by Z, since we hope this will prove to be of independent 
interest. The underlying ideas arc perhaps not new. In particular there are certain similar 
features in recent work of Briidern [3] . However we have not found anything in the literature 
which exactly meets our needs. 

We begin by supposing that we are given a sequence k{l), . . . , k{N) of complex numbers. 
We aim to show how to approximate k{n) locally by a function k{n). By this we mean that 
for any congruence class a (mod q) the sum 



S{a,q):= ^(^) 

will be approximated by 



n=a (mod q) 



S{a,q):= ^ ^(n), 



n=a (mod q) 

at least for small values of q. Naturally we can do this by merely setting k{n) = k{n), but 
we seek a function k{n) which is defined in terms of the density of the sequence k{n) in 
congruence classes to small moduli. 

As an example of what we have in mind, consider the sequence fc(n) = A(n), where A(n) 
is the von Mangoldt function. If we choose any constant 3 and set 

q^Q a(modq) 
gcd(a,q)=l 

with Q = (logx)^, then 

A(n)= Yl AQ(n) + OA(x(loga;)-^/2) (4.1) 

n=b (mod h) n=b (mod h) 

uniformly for h ^ Q, for all residue classes b{modh). This result follows from Heath-Brown 
[17, Lemma 1]. 

We first introduce our fundamental hypotheses. We assume that we have an arithmetic 
function p{a,q) and a "smooth" function oj{n), together with a bound E ^ 1 such that 

\S{a,q)-pia,q)S\^E, (4.2) 

with 
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for all residue classes a to moduli q^Q. We observe that 



S{a,rs) = S{b,r) 



a (mod rs) 
a=b (modr) 



for all h, r, s, and we therefore impose the natural condition that 

^ p{a,rs) = p{b,r) 



(4.3) 



a (mod rs) 
a=b (mod r) 



for all b, r, s. Since we can always re-scale the functions p and u in (4.2) there is no loss in 
generality in assuming that 

p(0,l) = l. (4.4) 
Although it is not necessary in general, it will prove convenient to assume that 

p{a, q) e M, p{a, q) ^ 

for all pairs a,q. 

We will also require a smoothness condition on the function uj{n), which we formulate as 
the bound 



J2 -Q '^S 



n=a (mod q) 

for all residue classes a to moduli q ^ Q^. 



^ W, 



(4.5) 



Our choice for k{n) will be motivated by the treatment of major arcs in the circle method. 
If we consider the exponential sum 



E(a) := k{n)e{an), 



then when a is close to a/q one would use the major-arc approximation 



X] p{c,q)eq{ac) ^ < 

^ c (mod q) 



XI <^{n)e{{a - a/q)n) 



When a is not close to a/q the above expression tends to be small. Hence it is reasonable to 
approximate S(a) by 



E E 

I^Q a (mod q) 
gcd(a,q)=l 



X l)^<li^^) \ { X '^He((" - a/Q) 

^ c (mod q) 



n) 



for all real a. Picking out the coefficient of e(an) in the above expression we are therefore 
led to suggest the choice 

fc(n):=a;(n)X X X /'fe 9)eq(ac). 

a(modg) c{modq) 
gcd(a,q)=l 



(4.6) 
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We proceed to investigate the sum S{b, h) with h ^ Q. We have 

Sib,h)= J2 ^HXI Y1 P^^'^^ X] eq{a{c-n)). 

n^N q^Q c(modg) a(modg) 

n=b{modh) gcd{a,q)=l 

The final sum over a (modg) is a Ramanujan sum, for which the standard evaluation as 

d|gcd(g,c-n) 

produces 

sib,h) = Y, E p(c,9)Ewd) E '^(")- 

g^Q c(modg) d\q n^Af 

n=b (mod h) 
n=c (mod d) 

The simultaneous congruences n = 6(mod/i) and n = c(modd) are only soluble if gcd(d,h) 
divides c—b, in which case there is a unique solution e (mod [d, h]) say. Thus if gcd(ci, h) \ c—b 
our hypothesis (4.5) shows that 

S{b, h) = ^ + ^, 

with 

q^Q c(modg) d\q ^ ' 

gcd(d,/i)|c— 6 

and 

c(modg) d\q 

In view of (4.3) and (4.4) we have 

E p(c,g) =/9(0,l) = 1, 

c (mod q) 

whence the crude bound 

d\q 

yields |<^| ^ WQ=^. 

Turning to the main term ^ we observe in general that 

E ^ = *-' E 

d\q ' ' d\q 

gcd(d,/i)|fc gcd((i,/i)|A; 

= /.-iEm«/'^) E ^ E ^(/) 

d\q e\d,h,k f\d/e,h/e 



= ^"' E ^ E m/)Em«/'^) 

e|q,?i,A; f\q/e,h/e d\q 
ef\d 

= E ' E M/) E "(^ 



e|q,/i,fc f\q/e,h/e g\q/{ef) 
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The final sum vanishes unless ef = q, in which case we must have q \ h. It then follows that 

d\q 
gcd{d,h)\k 



= h ^ XI eq{ak). 



a (modg) 
gcd{a,g)=l 

Inserting this result into our formula for the main term wc sec that 

q\h c(modg) a(mod<j') 
gcd(a,g)=l 

Using (4.3) we have 

p{c,q)= p{d,h), 

d (mod h) 
d=c (mod q) 

whence 

^ = Sh-^ p{d,h)Y E eg(a(d-6)). 

d{modh) q\h a (mod 5) 

gcd(a,g) = l 

As q runs over divisors of h, and a runs over residue classes coprime to q, the fractions a/q 
run over the entire set 

r n 
ih 

We therefore deduce that 

^ = Sh-^ Y P{d,h) Yl eh{n{d-b)) = Sp{b,h), 

d (mod h) n (mod h) 

since the summation over n produces the value h when d = b{m.odh), and the value 
otherwise. 

In view of (4.2) we may now summarise our results as follows. 



< n < /t} 



Lemma 5. — With the above assumptions we have 

\Sib,h) - S{b,h)\ ^ WQ^ + E 
for all h ^ Q and all residue classes b modulo h. 

Thus k(n) approximates k(n) well, in congruence classes to small moduli. It may be 
instructive to consider the effect of this procedure on the sequence k{n) = A(n) that we 
discussed earlier. Taking 

(l/ip{q), if gcd(a,9) = 1, 
1 0, otherwise, 

wc readily deduce from the Siegel-Walfisz theorem that for any 1 there exists a constant 
Ca > such that 

Y Hn) = P{a, q)N + O (^N exp(-CA(log iV) ^2) j , 

n=a (mod q) 



p{a,q) 



QUADRATIC POLYNOMIALS REPRESENTED BY NORM FORMS 



19 



uniformly for q ^ (logA^)"^. On taking oj{n) = 1 we see that E <^ exp(— C^(log A^)-*^/^) is 
admissible in (4.2). Since <C 1 in (4.5), the approximation in (4.1) is a trivial consequence 
of Lemma 5 with 

AQ(n)=a;(n)^ e,(-an) ^ ^ 

q^Q a (mod 5) 6(modq) 
gcd(a,g)=l gcd(6,q)=l 

9^*3 a (mod q) 

gcd(a,g)=l 

for g (logAf)^. 

In applications it may be important to know about the size of /c(n), and we therefore 
investigate the mean square 

S := ^ \k{n)\\ 



If we write 



then 



b (mod q) 

k{n) =u{n)Y ^ Ca,qeq{-an). 

q^Q a (mod q) 
gcd(a,g)=l 

Thus, if we assume that 

\uj{n)\ ^ Wo 

for all n € N, then the dual large sieve produces 



q^Q a(modg) 
gcd(a,9)=l 



q^Q a (mod q) 
gcd(o,g)=l 



In view of (4.2) we have 
with 



\Sca,q-T{a,q)\^qE, 



Tia,q):= ^ S{b,q)eq{ab) = ^ Kn)eq{an). 

fe(modij) n^N 

It follows that \Sca,q\'^ ^ 2|r(a,g)p + 2q^E'\ and hence that 

/ \ 

|5|2e ^ 2a;2(iv + q2) 



V 



q^Q a (mod q) 
gcd(a,g)=l 
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We now apply the standard large sieve to deduce that 

|5|2e ^ 2ul{N + Q2) [ E'^Q^ + {N + Q^) ^ \k{n)\^ 

We express this result formally in the following lemma. 
Lemma 6. — We have 



Moreover 



2 



providing that EQ ^ A?^. 

Thus under suitable circumstances the L^-norm of k{n) will have order of magnitude 
bounded by the L^-norm of k{n). 

Having described the situation for sequences k{n) indexed by Z we return to our original 
problem, in which we have a sequence a{x) with x running over Ol- We will describe the 
situation as it applies to a quite general function a, and only later, in § 5, restrict to the 
function (3.15). Wc shall consider sums in which x runs over a region R, say, and lies in a 
congruence class y (modg), where y & Ol and g G N. We therefore set 

S{y,q):= «(^) (4-7) 

xeR 

x=y (mod q) 

and 

S{y,q)-- Y "(^)' 
xeR 

x=y (modq) 

and assume that we have functions p{y, q) and (jj{x) such that p{y, q) is non-negative and 

\S{y,q)-p{y,q)S\<,E (4.8) 

for q ^ Q, with E ^ 1 and 

xeR 

As before we will assume that 

Y P{y^rs) = p{z,r) (4.9) 

y (mod rs) 
y=z (mod r) 

and 

p(0, 1) = 1. (4.10) 
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Finally, we shall suppose that 



x=y (mod q) 



(4.11) 



for all residue classes y to moduli q ^ Q^. 

In order to define a{x) we will need an analogue of the coprimality condition gcd(a, q) = 1 
which occurs in (4.6). It turns out that the correct generalisation is to require that there is 
no non-trivial common divisor of y and q in N. Of course this is not the same as requiring 
gcd{y,q) = 1 in Ol. We write 

y (mod q) 

to denote a sum in which y E Ol runs over residue classes modulo q such that y and q have 
no non-trivial common divisor in N. 

We also need an analogue for the exponential function eq{a). Recall that {1, r} is a Z-basis 
for ol, and hence also a Q-basis for L. If x = a + br e L, with a, 6 G Q, and if G N, we 
define 

eW(x) := e{b) and e^^^^x) := eg{b). 
These exponential functions have the property that if y G then 



(xy) = 



Q , 



0, if g t y- 



(4.12) 



(4.13) 



x (modq) 

Thus, for the analogue of the Ramanujan sum we have 

Y,* ef\xy)= d^i'iq/d). 

3;(modg) d\gcd{q,y) 

We are now ready to specify our approximation to a{x). We set 

a{x):=u:{x)Y E* 4^^-2/3^) ^ p{z,q)ef\yz). 

q^Q y {mod q) z{modq) 

An argument precisely analogous to that used for Lemma 5 then produces the following result. 
Lemma 7. — With the above assumptions we have 

\S{z,h) - S{z,h)\ ^ WQ^ + E 
for all h ^ Q and all residue classes z modulo h. 

To produce an analogue of Lemma 6 we shall assume that 

R = {a + br € Ol ■ |a — ao|,|6 — 6o|< ^} 
for certain aQ,bo. Then we will have a large sieve inequality for Ol, taking the form 



y (mod q) 



E 

xeR 



(xy) 



^ (V2]V + Q)4^|c^| 

xeR 



(4.14) 
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This follows from the two dimensional large sieve of Huxley [20, Theorem 1]. Under the 
condition that ^ A'^, which we now impose, we will then have ( V2iV + Q)^ < iV^. We 
proceed to argue as before to deduce from the dual of the above estimate that 

with 

z (mod q) 

This time the estimate (4.8) yields 

\Scy,g-T{y,q)\^q^E, 

with 

Tiy,q) = Y,a{x)ef\xy). 
Continuing as before we then obtain 

x&R \ xeR / 

This gives us the following conclusion. 

Lemma 8. — Suppose that 

R = {a + br e Ol : \a-ao\,\b- bo\ < N} 

for certain ao, bo, and assume that there is a constant coq such that \uj{x)\ ^ coq for all x E R. 
Then 

providing that Q"^ ^ N and E'^Q'^ ^ A^''. 



5. The functions a{x) and ao(a;) 

In this section we will apply Lemmas 7 and 8 to the function (3.15). It will be important 
for us to produce functions a and u which depend only on the set , and not on the set R, 
since we require results which are uniform in R. 

We will write x G Ol in the form x = .X1+X2T with xi,X2 G We shall also write 61^x/l{'^) 
as Ni(u) + N2(u)r. The reader should observe that this notation does not coincide with that 
used temporarily in (2.1), nor that used in our discussion of (3.12). We let Rhe a. square 
in the (xi, X2)-plane, with sides parallel to the xi and X2 axes, and side-length p <C i/"/^. 
This corresponds, by abuse of notation, to the square R used in defining the sums S{y, q) in 
(4.7). Extending this abuse of notation we shall allow SNx/l{^) to denote the ordered pair 
(Ni(u),N2(u)). 

We will specify the function u!{x) = ui{xi,X2) and verify its properties at the end of this 
section. For the time being we content ourselves with describing the key features as follows. 
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Lemma 9. — There is a continuously differentiahle function (j : — t- [0,oo), depending on 
^ , for which a;(x + h) — w(x) <C J7~"/^|h| for all x, h G M^, and such that 



u{xi,X2) dxi dx2 = meas{u G ^ : (5N;^/i(u) G R}, (5.1) 

IR 

for every square R as above. Furthermore oj is supported on a disc of radius 0(J7"/^) and 
satisfies uj{x) <C 1 throughout this disc. Moreover there is a disc of radius S> G~^U'"^^ around 
the point C/"/25N;^/i(uW) on which we have a;(x) S> G 

Although it is a real measure which occTirs on the right-hand-side of (5.1) what occurs 
naturally for us is the corresponding cardinality 

So := #{u e n : SNk/l{u) e R}. 

We proceed to establish a relation between the two. For each u G ^ fl Z" we let 

S{u) :={y eW" :ui^yi<Ui + l, {1 ^ i ^ n)} 

and 

^(-) := \J{S{u) : u G Z'^, S{u) C ^}. 

Thus C and the number of integer vectors in but not is 0(C/"~^). In 

particular, 

So = #{u G n Z" : SNk/l{-^) €R} + 0{U''-^). (5.2) 
Now if u G n Z" and y G 5(u) then S^x/dy) = (^N^/l(u) + 0(J7"/2-i)_ rj,^^ ^^^^^ 
are squares Ri and i?2 of sides p\ and /92 respectively, such that 

1/91 - p| < C/"/2-^ and 1/92 - p| < 

and for which 

5'^K/L{y) eRi ^ SNK/dvL) eR ^ 5NK/L{y) e R2 

whenever y G S{u). We deduce that 

#{u G n Z'* : (5N^/l(u) g ii:} 

^ meas{y G ^^'^ : dN^/Liy) e 

and 

#{u G n Z" : (5N;^/l(u) G R} 

^ meas{y G ^^^-^ : SNk/M e 
However '^('"^ is contained in and differs from it by a set of measure 0{U'^~^), whence 

meas{y G ^^^^ : <5N^/L(y) G Ri} 

^ meas{y G ^ : <5N^/L(y) G Ri} + 0(C/"-i) 

and 

meas{y G -^("^ : (5N;^/L(y) G R2} ^ meas{y G ^ : SNx/Liy) G ^2}- 
According to (5.1) we may then deduce that 

#{u G n Z" : 6'Nk/l{u) eR}^ / co{xi,X2) dxi dx2 + 0(C/"-^) 

jRi 
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and 

#{u G '^r^-) n Z" : ^N;f/i,(u) G i?} ^ M"'' / a;(xi, X2) dxi dxs- 

However the squares Ri and R2 each differ from i? by a set of measure 0(?7"~^), and fur- 
thermore a; ^ 1. Thus 

#{u G n : (5N;^/i(u) G i?} = M"'* / a;(xi, X2) dxi dxs + 0(C/"-^). 

JR 

Finally, if we compare this with (5.2), we deduce that 

So = M-'' [ io{xi,X2)dxidx2 + 0{U''-^). (5.3) 

JR 

We next consider 

x=y (mod q) 

To each point x counted in the above sum we associate the square R{x) = [xi,xi + q) X 
[x2, X2 + q)- If t G -R(x) then ^(t) — ^(x) <C qU~^^'^ by Lemma 9, whence 



/ w(t) dii di2 = g^w(x) + Oiq^U-""/^). 
Jrm 



IR{k) 

We sum this for points x G with x = y (modg). Since there arc 0{q^^U"') such points if 
q ^ U^/"^ we deduce that 



V a;(x) = g-2 Y I a;(t)dtidt2 + 0((zC/"/'^ 



xG-R xg_R 
x=y (mod q) x=y (mod q) 



The union of the squares -R(x) will be a square R' say, whose sides are within a distance g 
of the sides of R. Thus R and i?' differ by a set of measure 

0(gC/"/2), if ^ ^ C/n/2_ Since 

a;(t) ^ 1 for all t this produces 



V w(x) = f ^(t) dti dt2 + 0{qU''/^^ 

~r> JR! 



xei? 

x=y (mod q) 



= q-^ [ a;(t) dh dt2 + 0(g[/"/2). 
Comparing this with (5.3) we therefore obtain 



n— 1^ 



x=y (mod q) 

providing that n ^ 4 and q ^ U. In particular, when g = 1 we obtain 

S = M"So + 0([/""^), (5.4) 

so that (4.11) holds with W < U''-^ and any Q ^ [7^/2_ 

We next consider the condition (4.8). In view of the definition (3.15) we see that 

Siy, g) = E #{" e ^ n : u = w (mod [M, q]), 51^^/^^) G R}, 
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where w runs over vectors modulo [M,q\ for which w = u^^^-' (modM) and (5Nj^/^(w) = 
y (modg). For a general modulus r we proceed to compare the sizes of the sets 

T(r,x) := {u G ^ nZ" : u = x(modr), 6Nk/l{u) G R} 

for the values x = w and 0. For u = (ui, . . . , u„) let 

u* = (r[ui/r], . . .,r[un/r]). 

If u G T(r, w) then u* will belong to T(r, 0) unless either u is within a distance 0{r) of the 
boundary of , or 5N^/^(u) is within a distance 0{rU'^/'^~^) of the boundary of R. A pair 
{Ni,N2) arises at most Or){U^) times as a value of (5N;^/q(u) with u e ^ Ci Z", and hence it 
follows that 

#T(r, 0) = #T(r, w) + 0^(r[/"-^+'') (5.5) 
for any 7] > 0. If we now sum for all w (mod r) we deduce that 

r"#r(r, 0) = #{u G ^ n Z" : SNk/M e R} + 0^(r"+^?7"-^+^) 
and hence that 

r"#r(r, 0) = So + 0^(r"+^C/"-^+^). 
Substituting this back into (5.5) we deduce that 

#r(r, w) = r-"5o + 0^(rC/"-^+''). 

We therefore see that 

S{y, q) = M-^p{y, q)So + 0^(g"+iC/"-i+^), 

with 

/ ^ ^" r A\^^ u s = uW(modM), \ 

PM--=J^#[^i-^od[M,l]): ,N,/,(s)'=,(mod,) j" ^^"^^ 

The conditions (4.9) and (4.10) are now readily checked, and we see that (4.8) follows from 

(5.4) with E <Cr, Q"+iC/"-^+''. 

We are now ready to apply Lemma 7, which produces the following result. 

Lemma 10. — Let R be a square with side p ^ 1 satisfying p <C U^/'^ . Then if rj is any 
positive constant we have 

x=y (mod q) 

for all q ^ Q ^ U^/"^ and all y (modg). 

In our application the square R will vary and so it is crucial that Lemma 10 is uniform in 
squares of side length (9(J7"/^). As to Lemma 8, we will only need a result for the L^-norm 
taken over all x. Hence we choose i? to be a square centred at the origin and with side 
p = cU'^/'^ , in which the constant c is taken sufficiently large that x E R whenever a{x) ^ 0. 
We may clearly take <C 1 by Lemma 9 and 

\a{x)f «, U-+^ 

by Lemma 4. Our remaining task is thus to estimate S from below. However our choice of 
R ensures that = #('^ n Z") » whence (5.4) yields S » [/"G"", assuming that 

G ^ t/i/C^+i), say. Consequently we deduce from Lemma 8 the following bound. 
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Lemma 11. — For any constant rj > we have 

providing that Q^~^^ ^ U^~'^ and G ^ U^/(^+^) , Moreover we then have 

xeoL 

The final estimate fohows immediately from Lemma 4, since 

a;eoi xeoL xeoL 

We end this section by proving Lemma 9. We first show that the map from C" to C'^ given 
by u I— 7- (Ni(u), N2(u)) is non-singular at any point for which N;^/q(u) ^ 0, by which we 
mean that VNi(u) and VN2(u) are only proportional if Nj^/q(u) = 0. Clearly it suffices to 
do the same for the map 

(Ni(u)+rN2(u),Ni(u)+r^N2(u)). 

However Ni(u) + rN2(u) (respectively, Ni(u) + r°'N2(u)) is a product of S (respectively, S"^) 
with certain conjugates of uiui + • • • + UnOJn- Thus we can write our map in the form 

u ^ (Ai(u) • • • A„/2(u) , An/2+1 (u) • • • A„(u)), 

with suitable linear forms Aj, which will be linearly independent. Hence if we set Vi = Aj(u) 
for 1 ^ i ^ n it will suffice to show that the map 

{Vi--- Vr,/2 , Vn/2+1 ' ' ' Vn) 

is non-singular whenever f i • • • u„ 7^ 0. This however is trivial. 

Since Nj^yQ(u(''^)) / 0, we deduce that there are indices i and j such that 

5Ni(u«)5N2(uW) ^ aNi(uW)aN2(uW) 



dui duj duj dui 

We suppose for notational simplicity that i = 1 and j = 2, the other cases being similar. By 
continuity we then have 

OUi OU2 OU2 OUi 

throughout , if G is large enough. We now write v = (^1,1*2) and u = (v,w), where 
w = (ti3, . . . , Un)- We will find it convenient to number the entries in w as (u;3, . . . , Wn)- By 
the Implicit Function Theorem, if 

Ni(v,w)=xi, N2(v,w) = a:2 

with (v, w) G we can express v in terms of x and w as v = v(x, w). 
We may now calculate that 

meas{u e : dNx/Li'") G R} 

= / / J(v(x, w), w)~M'u;3 • • • di^n diCi dx2, 
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where 

W{^) := {w G M"-2 : (v(x, w),w) G ^^}. 

We therefore define 

w(x) := / J(v(x, w), w)-^ d«;3 • • • dwn, (5.8) 

Jwew(x) 

so that (5.1) immediately follows. It is clear from the definition of PF(x) that uj is supported 
on the set of values 51^ x /L (u) as u runs over ^ . Thus the support is contained in a disc of 
radius 0(^7"/^) as claimed in the lemma. Moreover 

meas{W^(x)} ^ mcas{w G M""^ : (v, w) G for some v} < W'^, (5.9) 

whence (5.7) yields w(x) ^ 1 as required. 

The function v(x, w) will be continuously differentiable with respect to x and w. Moreover 
it will be weighted-homogeneous of degree 1 in w and degree 2/n in x. Since dxk/dvi <^ 
ljn/2-i for 1 ^ fc, Z ^ 2 we deduce from (5.7) that 

5^ < J(u)-^C/"/2-^ < C/^-"/^ il^k,l^2). (5.10) 

OXi 

We observe from (3.9) that d]>iK/Q{u^^^)/dws does not vanish, whence 

for at least one of j = 1 or j = 2. Moreover 

aNj(v(x, w),w) ^ dxj_ ^ ^ 

for j = 1, 2, whence 

dNj dvi 9Nj dv2 _Q (■ -12) 

dv\ dws dv2 dws dws ' ' 

Thus at least one of dvi/dws and dv2/dw3 is non- vanishing at u^'*). We suppose that 
dvi/dw3 7^ 0, the alternative case being similar. By continuity we then have {dvi/dw^l ^ 1 
for |u — u(^)| <C G~^, if G is large enough. It follows that 

dvi 



dw?. 



» 1 (5.11) 



throughout ^ since the partial derivative is homogeneous in w, of weight zero. 
We can now investigate a;(x + h) — a;(x). On l^(x + h) fl PF(x) we have 

J(v(x + h, w), w)~"^— J(v(x, w), w)~"^ 

< J(v(x + h, w), w) - J(v(x, w), w)|, 

by (5.7). Moreover 

J(v(x + h,w),w) - J(v(x,w),w) < C/''~^|v(x + h,w) - v(x,w)|, 

since J(u) is a form in u of degree n — 2. It then follows from (5.10) that 

J(v(x + h, w), w)-^ - J(v(x, w), w)-^ < C/2-3'»/2|h| 

on VF(x + h) n W(x). We therefore see from (5.9) that the contribution to a;(x + h) — a;(x) 
from the set VF(x + h) n W^(x) will be ©([/""/^Ihl). This is satisfactory. 
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On the remaining range we merely use the bound J^^ <^ C/^^". It therefore suffices to 
estimate the measure of the set of w G M"~^ for which either (v(x + h, w),w) G ^ or 
(v(x, w), w) G "W, but not both. By substituting x' = x + h and h' = — h if necessary, 
we may suppose that (v(x, w),w) G ^ and (v(x + h, w),w) ^ . In view of (5.10) this 
means that (v(x, w),w) hes in at a distance 0{U^~^^'^\h.\) from the boundary of Let 
'^x(h) denote the set of such points w. As yet we have not completely specified the set ^ 
in (3.14), and it is now time to do so. For indices i ^ 3 we merely choose Li{u) = wi. Then 
if (v(x, w), w) lies in at a distance 0(L'"^~"/^|h|) from the edge defined by Lj we see that 
the corresponding wi runs over an interval of length 0(C/^^"/^|h|), so that the contribution 
to meas(^x(h)) is <C J7-'^~"/^|h|J7"'~^ = t/"/^"^|h|. We take the remaining linear forms to 
be v\ and v\ + \v2- Here A is a non-zero constant chosen sufficiently small that 



d(vi + \V2) 



> 1 



throughout ^ . In view of (5.11) such a choice will be possible, since we have dv2/dws <C 1. 
Suppose now that W/i,W5,...,Wn are fixed. Then if (v(x, w),w) lies in at a distance 
0(U^~^/'^\h.\) from the edge defined by the linear form Lj = vi we see that ^3 is confined 
to an interval of length 0(?7^~"/^|h|), and similarly for the edge defined by = vi + \v2- 
It follows that the contribution to meas 

<^ f/^^"- we deduce that w(x + h) — t^(x) ^ [/^"/^|h| as required. 
It remains to establish the lower bound for w(x). It is clear that J(u) is homogeneous of 
degree n — 2, whence J(u) ^ C/""^ for all relevant u. In view of (5.8) it therefore suffices to 
show that meas{PF(x)} S> on a suitable set of values x. Now 

(v(5N^/,(uW),nf , . . . ,«f )) , uf\ . . . ,«f )) = u«. 

Moreover if |x| <C [7*^/^ and |w| ^ U then 

v(x, w) - V (c/-/2<5N^/^(uW), Uuf\. . . , Uuf^) 

< t/||i7-"/2x-5N;^/i(uW)| + max |C/-V -«f^|l , 

by the homogeneity properties of v(x, w). We therefore see from the definition (3.14) of 
^ that there is a small constant c > such that (v(x, w),w) G ^ whenever we have 

\wi — Uuf^\ ^ cG^^U for 3 ^ i ^ n and 

|x - C/"/2<5N^/^(uW)| cG-^C/'^/^ 
We therefore have meas{VF(x)} ^ Q2-njjn-2 -j^ ^^le above region, as required. 



6. A large sieve bound for ao{x) 

Prom Lemma 10 we know that ao{x) is evenly distributed in all congruence classes for 
moduli q ^ Q. The condition that Q ^ U^^'^ causes no problems. However a more serious 
constraint on the size of Q comes from the fact that we cannot handle d if Q is too large. 

The goal of the present section is to show that in fact ao is evenly distribution for "almost 
all" congruence classes, for much larger values of q. This will enable us to get equidistribution 
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for large moduli, on average, while keeping Q sufficiently small that a can be adequately 
handled. 

Our equidistribution result will be achieved by a quite general large sieve argument, moti- 
vated by (but, we believe, simpler than) that used by Fouvry and Iwaniec [12, § 10]. A related 
procedure is given by Iwaniec and Kowalski [21, Theorem 17.5]. However we have found it 
more convenient to use additive characters directly, rather than to switch to multiplicative 
ones as they do. The reader might care to note that a somewhat different argument, also of 
a very general kind, appears in Heath-Brown [17, § 2]. 

We assume that we have a function ao (x) defined on , satisfying 



xeR 

x=y (mod q) 



^ Wo, (6.1) 



for all q ^ Q and y E Ol, where i? is a square of side 2N as in Lemma 8. For the duration of 
this section let us write 



x=y (mod q) 



for any square S. We proceed to consider 



If 



for t E L then 



and we deduce that 



Si(Qo):= \nR\Q,y)?- 

q^Qo y(modg) 

K{t) ■.= Y,Mx)e^^Htx) 

X&R 

^ ao{x) = q-' J2 ef\-hy)K{b/q), 

xeR b (mod q) 

x=y (mod q) 



SliQo) = E E I^(V9)P. 

q^Qo b (mod q) 

We now write the fraction b/q in lowest terms as c/h, say. Then 

SiiQo) = E E * \Kic/h)\''#{q ^Qo:h\q} 

h^Qo c (mod h) 

^Qo E E* \K{c/h)\''. 

hf^^Qo c (mod h) 

The contribution from terms /i > Q is at most 

QoQ-'Yl E* I^(c//i)|'^QoQ-'(V27V + Qo)"EI"o(^)I'' 

ht^^Qo c{modh) xeR 

by the two dimensional large sieve in the form (4.14). 
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For the remaining terms with h ^ Q we observe that 

\K{c/h)f^ \K{c/h)f = h' Yl \m;Q,y)f 

c (mod h) c (mod h) y (mod h) 

by our assumption (6.1). Thus terms with h ^ Q contribute at most QoQ'^Wq to Si{Qo). 
This enables us to conclude as follows. 

Lemma 12. — Under the assumption (6.1) we have 

2 



q^Qo y{modq) 



x=y (mod q) 



< C^oti/ M/q -I 7^ 2^ l«o(a::)| 



We will require a form of this estimate in which we have a maximum over different squares 
R. We assume that ao{x) is supported on a set 

S = {a + br € Ol : {a,b) G {-N,Nf} 

and proceed to cover this with smaller squares, each contained in S and of the type 

Ri = {a + bT£OL: {a-a,,b- bi) G {-N/K, N/Kf}, 

for appropriate pairs {ai,bi). Here ^ is a positive integer parameter which we will 
specify in due course. Now any square R C S, with sides aligned with the coordinate axes, 
will include a union of certain of the squares Ri, outside which there are only 0{N'^K~^) 
points of Ol- If we now require that 

\ao{x)\ ^ Ao (6.2) 

for all X, then 

\j:{R;q,y)\^0{N^K-'Ao)+ Y \^{Ru<l,y)\- 

Thus if 

'52(Qo):=Vg' Y ^lax|E(i^;g,^/)|^ 
q^Qo y (mod q) 

then 

s2m<^N'K-^AiQi+K^YY.^' E \nRi;Q,y)f- 

i^K'^ q^Qo 2/(modq) 

Lemma 12 then yields 

Y<i' H |s(i^i;g.y)r«QoQX^ + ^°^^%"'^^°^ Y M^)l'- 

q^Qo J/(modg) x&Ri 

Since 

Y Yl l"o(a;)|^ ^ Y l«o(a;)|^ 

i^K^xeRi xGR 

we deduce that 

Q 
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We now choose K = AqQq, which yields the following conclusion. 
Lemma 13. — Under the assumptions (6.1) and (6.2) we have 



9<<3o 2/(modq) 



max 

R 



q;o(x) 



x&R 
x=y (mod q) 



« iV^Qo ' + 4Qo'QX' + QoQ-'A^' 5] |ao(^)|', 

providing that ^oQo ^ 

Wc apply this last result to our particular situation. In view of the conditions in Lemmas 10 
and 11 we take AT of order U""/"^, Q^+s ^ C/^-" and G < C/V{"+i), so that the value Wq = 
Qn+ijjn-i+rj -g admissible. In order to estimate ao(x) wc note that a{x) <Cr) for any 
?7 > 0. Moreover (4.13) yields |a(x)| ^ Q^uj{x), since (4.9) and (4.10) imply 



^ p{z,q) = l. 

z (modq) 



(6.3) 



Since uj{x) <C 1 by Lemma 9, we will certainly have aQ{x) Q^U^. Taking Aq Q^W^, 
the right hand side in Lemma 13 is therefore 

«r, + Ql3Q2n+18^2n-2+0(^) ^ Q^^Q-lu^n+nQOil) ^ 

by Lemma 11. This enables us to conclude as follows. 
Lemma I4. — IfQ^Qo^ u^/i^+^e) q ^ [/i/(n+i)^ ^/^^^ 



g^Qo y (mod q) 



max 
R 



^ ao(a 



x=y (mod g) 



for any rj > 0. 



We end by establishing a trivial bound for the above sum, which provides an instructive 
comparison. If g ^ Qo ^ [/"/^ then 



^ ao(a 



x=y (mod g) 



<C #{x E R : X = y (modg)} |ao(a 



x=y (mod g) 



a;=j/ (mod q) 
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whence 



E 

y (mod q) 



max 
R 



E ao{x) 



x=y (mod q) 



2/ (mod q) x=y (mod g) 
xEOl 



We therefore obtain the trivial bound 



q^Qo y (mod g) 



max 



E "o(a 



x=2/ (mod g) 



(6.4) 



via Lemma 11. Thus Lemma 14 provides a saving which is a power of Q, providing that Q is 
larger than a suitable power of U^G. 



7. Bilinear forms in dimension 2 

The estimation of bilinear forms is one of the cornerstones of analytic number theory and 
can be traced back to the work of Vinogradov. Given finite sequences Um,Vn £ C and a 
matrix A = {ajn,n) of complex numbers, the essential problem is to estimate the double sum 

U^Av = E E '^rnVnam,n- 



The standard procedure is to use Cauchy's inequality to remove the dependence on u = {um)- 
This gives 

/ \ 1/2 / 2\ V2 



lu'^Avl ^ 



E 



E 



For the second term on the right one expands the square and reverses the order of summation, 
so as to use suitable information about the sum 



E' 

m 

It may happen that the sum itself is small. In other cases one can give an asymptotic 
evaluation with a main term M{ni,n2) say. One may then complete the analysis via an 
estimation of the sum ^ ^ 'M{ni,n2). 

In our work we will require an analogue of this procedure for sequences indexed by elements 
of Z^, rather than by Z. In this endeavour we are inspired by the analytic machinery developed 
for the Gaussian integers by Fouvry and Iwaniec [12, § 9]. We will provide a completely 
self-contained account of the method, which has the advantage of being slightly more general 



m scope. 

Let Mg SL2( 



and let a, ^ : — >■ C be arbitrary functions with finite L^-norms 

X 1/2 / \ 1/2 

iaii2:= I Ei"Wn ' 11^11^^= Ei/3(y)" 
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Furthermore, let A : Z ^ C be such that 



|A| 



1/2 



is also finite. Our objective is to estimate the double sum 

S{a,/3;X) := ^ ^^^^ a(x)/3(y)A(x^My) , 

where indicates that the summation over y is restricted by the condition gcd(yi, 2/2) ^ E. 
In the analysis of this section the implied constants in our O(-) and <C notation will be 

allowed to depend implicitly on the coefficients of the matrix M. However they will be uniform 
in all other parameters. It will be convenient to introduce a norm on elements x G via 
|x| = max{|a;i|, \x2\}- We should observe that this particular choice of norm is important, 
since it will be significant for us that the unit ball is a square. 

Let A,B ^ 1 with A ^ B. Wc will think of A and B as being large, with A/B also large, 
but of considerably smaller order than B. We will suppose that a, /3 are supported on the 
sets 

^ = {x G : |x| ^ A}, ^ = {y G Z^ : S ^ |y| ^ 2B}, (7.1) 
respectively. Let Mq denote the maximum of the moduli of the entries of M, and let 



M 



We now define 



1 
-1 

-1 



M. 



B = {zeZ^ : {2Mor'B ^ |z| ^ AMqB} 



and we observe that My G B whenever y G 

An application of Cauchy's inequality now gives 



\S{a,f3;X)\^ ^ 



Enlarging the range of summation for y we therefore deduce that 

|5(a,^;A)| ^11^112- II A||2-r(«)V2, 

where 



(7.2) 



n«):= E'"E 



MyeB 



E 

xez2 

x'^My=i 



ax 



Opening up the inner sum we obtain 

T(a) = 



E 

wGZ2 



{a * q)(w). 



MyeB My=0 
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with 

(a*Q;)(w):= a(x)a(x'). 
x,x'ez2 

X— x'=w 

The vectors y take the form eu where u = {ui,U2) € is primitive, and where e ^ £^ is 
a positive integer. Since detM = 1, we see that the general solution of the linear equation 
w^Mu = is _ 

w = cMu 

for c G Z. Hence 

n«) = E E* E("*«)(^Mu) = j;(a*a)(cz), 

eMueB 

where ^* denotes summation for primitive vectors z = (^1,^2)- Wc note that Cauchy's 
inequality yields \{a * a)(w)| ^ ||a||2 for any w € Z^. Thus the contribution to the above 
sum from terms with c = is 0(||a||2i?^), whence 

Tia) = To{a) + 0{\\a\\lB^), (7.3) 

with 

^o(«):=E E* E 

g2.es cez\{o} 

We now use the Mobius function to pick out the condition gcd(zi, 22) = 1, giving 

Tb(a) = E E E^^^^^o^"'^'*^'^)' 

e^Ece'Z\{0} 6=1 

where 

To{a;b,c,e) := ^ (a*a)(w). 

6c|w 

For ec~^w e B we deduce that 

{2Mo)~^e-^cB ^ |w| ^ AMoe'^cB. (7.4) 

Moreover we will have (a * q;)(w) = unless |w| ^ 2 A. Thus we can restrict the summation 
over c to the range 

< |c| ^ AMqAB-^E = C, 

say. 

We may now handle certain ranges of b and c trivially. To this end we give ourselves a 
parameter K ^ 1 which we will choose in due course. In view of (7.4) the number of available 
vectors w = (mod 6c), corresponding to a particular triple b,c,e is 0{B^b~^e^^). Since 
I (a * q;)(w)| ^ II a II 2 we deduce that the contribution to To{a) from 6 > if is 

« E E E « \\a\\lABEK-\ 

e^Ec^Cb>K 

Similarly the contribution from |c| ^ CK~^ is 

«E E E « ii^iii^^^^"'- 

e^E\c\^C/K b 
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It follows that 

^o(a) = E E '"(^) E b, c, e) + OiWagABER-^). 

ei^EbiiK C/K<\c\<:C 

Combining this with (7.2) and (7.3) therefore leads to the conclusion that 

S{a, P; A) < WPh ■ ||A||2 { ||a||2S + WaUABEK-^y/"^ + ri(a)V2| 
for any B ^ 1 and K ^ 1, with 

^i(«)-E E E \To{a:b,c,e)\. 

e^E b^K C/K<\c\i:C 

We now open up the convolution a * a to obtain 

ro(a;6,c,e) = ^ a(x') ^ a(x), 



xsx' (mod be) 



where 



Thus 



= {x G : (2Mo)"^e"^cS ^ |x - x'| ^ AMoe'^cB}. 



|ro(Q;; 5, c, e)| ^ |a(x')|max 



E «(^^ 



xe/J 

x=x' (mod 6c) 



where R now runs over all squares with sides aligned to the axes. It follows from Cauchy's 
inequality that 

2 



|ro(Q;; fe, c, e)p ^ HctHi ™^ 



E «w 



x'e.(/ 

< ||a||^(l + ^'(6c)-')r2(a;|6c|), 



xei? 

x=x' (mod fee) 
|2/i , /l2/l,„^-2^ 



with 



T2(a;^):= ^ 



max 
R 



E 



xe-R 

x=u (mod q) 



(7.5) 



u (mod q) 

If we insist that E ^B then 1 + ^|6c|"^ ^ 1 + AKC~^ < BKE'^, whence 



Ti(a)<^^ ^ ||a||25ifE-ir2(a;|6c|)V2. 

fe^if C/K<c^C 

Hence, if r(g) denotes the usual divisor function, and we set 

q^CK 



(7.6) 
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a further application of Cauchy's inequality yields 

\clK<q<,CK J 

« BK\\ah {KC-\\ogCf]^''' tI''' 
<,(BKfl''{^AE)-^l\\o^Afl''\\a\\2Tll\ 
We may now conclude as follows. 

Lemma 15. Let A ^ B ^ E ^ I and define C = AMoAB'^E. Let a, P : Z'^ C be 
functions supported on the sets (7.1), and let X : Z ^ C Define T2{a;q) and as in (7.5) 
and (7.6). Then for any K ^ 1 we have 

S{a,P; A) < ||/3||2 • ||A||2 {\\a\\2B + \\a\\2{ABEK-^)^/^ + Ti{a)^/^^ , 

with 

Ti{a) < iBKf/''iAE)-^/\logAf\\a\\2T^/\ 

Before preceding further it may be helpful to comment on the above estimate. For the 
purposes of this illustration we shall suppose that K is chosen so that 

max{^^ log^ A} K ^ A/B. (7.7) 

If we merely estimate S{a,l3; A) via Cauchy's inequality, using the fact that x-^My = I has 
0{AB log A) solutions x, y for each given I, we are led to a trivial bound 

S{a,P;X) < ||a||2 • ||/3||2 • || A||2 log A 



Hence the first term in the above lemma saves at least \J AjB. Similarly the second term 
saves at least ^ K/E. Both these are at least K^/^. To analyse the third term we use the 
argument in (6.4) to deduce that 

T2{a-q) < A2g-2||a||i 
for q<^ A. We therefore obtain the trivial bound 

ri(a) « ||a||iAV2s=^/2cV2^2^-i/2iQg2^ 

< \\a\\lABK'^\og^ A 

For comparison, in order for the third term in Lemma 15 to produce a comparable saving to 
that in the first two terms, one would wish to replace the above by 

Ti{a) < \\a\\lABK-'^, 

say. The type of saving we require is exactly that given by Lemma 14, providing that we 
work with S{aQ, /3; A). 

We now show how Lemma 15 can be applied to estimate 

^{G,H,V)= ^ ^«o(x)/3(y)A(Tr(x,y)), 
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when G = logV^. We have yet to specify the parameter Q used in the definition of a, but 
we shall do this shortly. Under suitable hypotheses we will see in § 8 and § 9 that the main 
term ^(G, H, V) has order at least (log y)i-3ni7ny2n Qur goal is to show that S{G, H, V) 
is smaller than this by at least a power of H. 

li X = xi + X2T and y = yi + y2T, then Tr(a:;, y) = x^My, where 



M 



-1 

1 



By abuse of notation we will write ao(x) = ao{x) and /3(y) = l3{y). From our definitions of 
a, a and w in (3.15), (4.13) and (5.8) we see that if ao{x) / then x = 6Nj^/i{u) for some 
u G Moreover, from (3.14) we deduce that SN^/lH = W'/'^SNK/Li^'^^^) + o(C/"/2) 
as G — >■ GO. Since 5N;^/i(u('^^) 7^ it follows that a(x) is supported on a suitable set 
|x| ^ A with A = ci?7"/^ for a certain constant ci > 0. We may analyse the support of /3 
similarly. Since N/^/^(v''^)) 7^ we deduce that /3(y) is supported on a set i? ^ |y| ^ 2B 
with B = c^y^l"^ for a suitable positive constant ci- Moreover, ,5(y) is also supported on 
vectors y with gcd(yi, 2/2) ^ 1, by (3.13). We may therefore take E of order 1. 

Estimates for ||/3||2 and ||A||2 are given by Lemma 4, while Lemma 11 handles ||ao||2- 
Inserting these bounds into Lemma 15 and adopting the assumption (7.7), together with the 
bounds g"+5 ^ C/i-^' and G ^ \J'^I^'^+^) ^ we then see that 

B + {ABEK-^fl'^ < [^n/4-j^n/4^-l/2 

and 

{BKfl'^{AE)-^l'^{}.OgAf < [/-V4y3n/4^5/2^ 

so that 

S'{G,H,V) <^ y("'+'?)/2;4^("'+'?)/2^^(n+??)/2 _ ^n/4yn/4^-l/2 
_^ jj-n/8y3n/8j^5/4 . [/{n+7?)/4^1/4^ ^O(l) _ 



We now apply Lemma 14, along with the bounds C <C AB ^E and G = 0^(y ), to deduce 
that 

q^CK 

providing that 

Q^GK<?7^/("+i6). (7.8) 

Note that when G = logV the condition G ^ [7^/("+i) is automatic for large V. Moreover, if 
Q ^ {7i/("+i6) ^j^gQ automatically have Q^~^^ ^ U^~^, if 7] is small enough. 
Thus, on recalling that U = HV and W = H^/'^V, we deduce that 

<f (G, H, V) <^ i/«/4yn+0(r,) l^jj3n/iynj^-l/2 ^ ^3n/4-^n^3/2g-l/4 ^ 
= iJ"y^"+0('?)(^-l/2 _^ iv:3/2Q-l/4^_ 

It is now apparent that we should take K = Q^/^, which will satisfy (7.7) and (7.8) if 
Q < H'^''/'^ and (logy)^^ < Q < ^-4n[j8/(n+i6) ^g conclude as follows. 
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Lemma 16. — Let G = log V and assume that 
Then we have 



8. Estimation of the main term 

The purpose of this section and the next is to produce a satisfactory estimate for the sum 

^ = ^{G,H,V)= ^a(x)/3(y)A(Tr(x,y)), 

as G — 7- oo, where a is the approximation to a which wc constructed in (4.13). Our goal 
will be to demonstrate that ^ has order (^-2Ti^ny2n^ which simple heuristics suggest to 
be the expected size of ^{G, H,V). In fact we will fall a little short of this, showing that 
^ S> (5i-3n^r!.y2n -^jjg^ suitable constraints are placed on the parameters Q, G, H and V. 
Let 

:= {v G rn Z" : V = (modM), (3.12) holds}, 
yTi :={wG>rnZ":w = wW(modM)}, ^^'"^^ 

where y,W are given by (3.14). Opening up the functions P and A from (3.16) and (3.17), 
respectively, we find that 

= E E E "(^) = E E -^(v, w), 

Tri/Q(a:Nj^/i(v))=2Njf/Q(w) 

say. Recall that Dl = t — t'^ where {1, r} is a Z-basis for Ol- It will be convenient to make 
the observation that 

2TVi/Q(r2)-TVi/Q(r)2 = L>i, 

which is a non-zero integer. By enlarging the weak approximation set S in Theorem 2, if 
necessary, we may clearly assume that M contains any prime divisors of 2D£. 

We now set in motion our analysis of ^(v, w), for given v G 1^ and w G #i. Suppose 
that N;^y^(v) decomposes as Ni(v) -|-N2(v)r, for suitable forms Ni, N2 of degree n/2. Then 
(3.12) demands that that Ni(v) and N2(v) be coprime. Moreover, in view of (3.8) and our 
convention that wi = 1 in the integral basis {uJi, . . . for K over Q, it is clear that 

Ni(v) = l(modM), N2(v) = O(modM). 

Let us write x = xi + X2T. The constraint in the x summation is equivalent to 

aixi + a2X2 = b, 

for integers ai, 02, 6 such that 

«1 = T>L/q(N;^/l(v)) = 2Ni(v) + IVi/Q(T)N2(v), 

«2 = TiL/qirNK/Li^)) = T:^/q(t)Ni(v) + IV^/q(t2)N2(v), (8.2) 
6 = 2N;f/Q(w). 
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In order to analyse ./#(v, w) wc will need to recall the expression for a{x). Let the function 
uj{x) = uj{xi,X2) be given by (5.8), with key properties as in Lemma 9. Then for any 
parameter Q ^ 1 and x E Ol we have 

I^Q t(modg) z(modq) 

where p{z, q) is given by (5.6) and the notation ^ * means that the sum is taken over t = 
ti + t2T, with ti,t2 G Z/gZ such that gcd{q,ti,t2) = 1. Define 

cg{t):= J2 p{^.Qyf\tz), (8.3) 

z (mod q) 

for t(modg). It is clear from (6.3) that |cg(t)| ^ 1. Inserting our expression for a.{x) and 
breaking the inner sum into congruence classes modulo g, we see that 

^(v,w) = ^ Y,* E 4^^H0=^.(v,w), 

I^Q t (mod q) r (mod q) 

where 

^,(v,w):= ^(^)- 

aixi+a2X2=b 
x=r (mod q) 

We proceed to investigate the summation conditions on x. We first prove that ai and 02 
are not both zero, that b is non-zero, and that 

In particular gcd(a4,a2) | b. 

To establish the claim we observe that gcd(ai, 02) is a common divisor of 

2a2-TVi/Q(T)ai =L>iN2(v) 

and 

1VL/Q(r')ai - TtL/Q{T)a2 = I)iNi(v). 

However v and w are in the regions (3.14) with N/f/^(v'^^)) and N^/q(w*^'*)) non-zero. Thus 
if G is large enough we will have ^k/l{'^) and N;^/q(w) non-zero. Hence b will be non-zero 
and similarly, since Dl 7^ 0, the numbers oi and 02 cannot both be zero. We also see that 

gcd(ai,a2) I I)igcd(Ni(v),N2(v)) =Z)i. 

Since Ni(v) = 1 (modp) and N2(v) = O(modp) for any prime divisor p of 2I?£, the claim 
readily follows. 

We now write a'^ = 2~'*^aj for i = 1,2, and b' = 2~'^b, so that the first condition on x 
becomes a[xi + 02X2 = b'. Thus the sum will be empty unless a[ri + 02^2 = b' (modg), as we 
now assume. For future use we note that this is equivalent to demanding that 

2"" Tr,./Q(rN^/^(v)) = 2'---Nk/q{^) (mod q). (8.5) 

We may therefore write a'^^ri + a2r2 = b' + qk for some k ^ "L. Then, if we take x = r + qy, 
the summation conditions on x translate into the requirement that a'^?/i + a2?/2 = —k. Since 
a'l is coprime to a'2 we can find integers 01,02 such that o'^^oT -|- O2O2 = 1. We then have 
^1 (yi + kW) + 02 (2/2 + ^02) = 0, so that yi + koi = —ma'2 and 7/2 + ^02 = ma'i for some integer 
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m. It follows that the summation conditions aiXi + 02X2 = b and x = r (modg) are satisfied 
if and only if xi and X2 take the forms xi = ri — alkq — a'2qm and X2 = r2 — a^kq + a'^qm 
respectively. Our conclusion is therefore that 

if,(v,w) = ^/(m), 

if (8.5) holds, where 

/(m) := oj{ri — alkq — a^qm, r2 — a^kq + a'^qm) . 

Wc would now like to replace the discrete summation over m, by a continuous integral. For 
this a relatively crude approach is available to us through Lemma 9. Thus \i v & [0, 1] it 
follows that 

f{m + v) - f{m) < ?7-"/2gmax{|a'i|,|a2|}. 
Moreover Lemma 9 tells us that uj is supported on a disc of radius 0(C/"/^), whence / is 
supported on an interval with length 0(C/"/^/(gmax{|a'^|, |a2l}))- Recalling that U ^ V in 
Lemma 3 and max{|a']^|, \a'2\} <C F"/^, this therefore produces the conclusion 



J —( 



< l + 5?7~"/^max{|ai|,|a2|}<g. 



/(m)dm- ^ /(m) 

Assuming that 02 7^ the change of variables m = q~^{—2'^x + ri/a'2 — alkq/a'2) now yields 

J2;(v,w) = ^/(v,w) + 0(g), 

for r satisfying (8.5), with 

/oo 
u{a2X^—aix + h/a2)dx. (8-6) 
-00 

Here ai, 02, 6 depend on v and w and are given by (8.2). If 02 = we reverse the roles of oi 
and 02 to produce an integral involving uj{—a2X + b/ai, aix). 

Recall the estimate |cq(i)| ^ 1 that we recorded above. Inserting our estimate for ^r(v, w) 
into that for ^(v,w), it now follows that 

^(v,w) = 2''/(v,w)5;i c,(i) Yl ef\-tr) + 0{Q^). 

g^Q t(modg) r(modg) 
(8.5) holds 

We now sum both sides over all v € l^i and w G On observing that i^Yi = 0(G~"y*) 
and = 0((jr~"M^"), we see that the overall contribution from the error term is 

This will be satisfactory if Q is sufficiently small compared to H. Our work so far has shown 
that 

^ = 2« ^ J2 ^(V, W) ^ + O (^Q6^"/2y2n^ ^ ^8 7) 

where are given by (8.1), and 

'^^= ^^(*) E 4^H-*o- 

t (mod q) r (mod q) 

(8.5) holds 
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Opening up (8.3), we find that 

^= E E p('-''i^ E* i'-Htiz-r)). 

r (mod q) z (mod q) t (mod q) 

(8.5) holds 

We have seen that the condition (8.5) can be written a[ri + a2r2 = b' (modg), for non-zero 
integers a'i,a'2,b' such that gcd(a'^,a2) = 1. The inner sum is a Ramanujan sum and thus it 
follows from (4.12), combined with (4.9), that 

^=^«V(9/u) E E z'^^'^) 

u\q r(modg) z(modg) 

r 1 r2 s6' (mod q) z=r (mod u) 

= ^ii'^li{q/u) ^ p{r,u). 

u\q r(modg) 

a'-^r\+a'2r2=b' (mod q) 

Let q = uv. Writing r = r' + ur" for r' (modtt) and r" (modu) we see that the sum over r is 
equal to 

/?(/, tt)# {r" (modu) : a'^r^ + a2r2 = 5(r') (mod f)} , 

r' (mod u) 
a\r'^-\-a'2T'r2=h' (modn) 

where g(r'^ = u~^{b' — o'^r^ — a2r2). Since a[ and 03 are coprime it follows that there are 
precisely v possibilities for r". The definition (5.6) of p{r',u) therefore reveals that 

T>^n -^ui^iq/u) ( s = u(^-^) (modM), 

^ M,d" 1 L > J/ 2 '^F(v;w;s) = O(mod'u^ 

u\q 

where 

F(v; w;s) := IVi/Q((5N^/i(s)N^/i(v)) - 2N^/q(w). 

This is the polynomial that underpins the variety introduced in (3.5). 

We proceed to insert this expression for into (8.7). We will use the Mobius function to 
remove the coprimality condition (3.12), which is implicit in the definition (8.1) of Yi. We 
therefore arrive at the estimate 



fc=l q<^Q u\q ^ ' ^ s(mod[M,«]) 

ssu(*f) (mod M) 



uniq/u) ^ 

s (mod [M,u]) 

(8.8) 



^ 7(v, w) + O (Q^i?"/^^") , 



(v,w)e-/'2x)ri 

2~'^F(v;w;s)=0 (modn) 

where /(v, w) is given by (8.6) and is defined as for '^1, but with the condition (3.12) 
replaced by k \ N^/^(v). This latter condition is taken componentwise as k \ Ni(v) and 
k I N2(v), in the usual way. 

The sum over k is empty when k ^ y"/^ so that we only need consider values k <^ y^l"^ . 
However we need to reduce this range further. Since N^//,(v('^)) / it follows from (8.2) 
and the definition of 'f that 

max{|ai|,|a2|}» V^"/^. 
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The integration in (8.6) is therefore over an interval of length < y-n/2jjn/2 ^ 
Lemma 9. A second application of Lemma 9 to bound the size of u now yields 

I(v,w) < (8.9) 

Hence there is an absolute constant c > such that the overall contribution to ^ from terms 
with k > K \s 

K<fc<y"/2 q^Q u\q wg^i 

where 

Mfc(X) := #{v G : |vK X, I N^/i(v)}. 
Clearly fc^ | N;^/q(v) whenever k \ N;^/x,(v), whence 



The overall contribution to the main term from k > K \s therefore seen to be 

«. K • 

This allows us to truncate the summation over k in (8.8) with acceptable error. 

We now replace the sum over s in (8.8) by one in which the variable runs modulo [M, u, k]. 
This has the effect of multiplying it by [M,u,k]~'^\M,u\^ . We may therefore summarise our 
findings in the following result. 

Lemma 17. — Let K ^ 1 and let k be given by (8.4) . Then we have 

^=2«M"EM^)EE^^ E ^m(s) 

k^K q^Q u\q s(modA) 

s=u(*f) (modM) 

where A := [M, u, k] and 

'^k,ui^)'= E ^(v,w). 

(v,w)er2x#i 

2-«F(v;w;s)sO (mod u) 

Here Wi is given by (8.1), I is given by (8.6) and 1^ is defined as for in (8.1), but with 
(3.12) replaced by k \ ^k/l{'^)- 
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The next phase of the argument concerns a detailed analysis of J^k,u{^)- It is natural to 
break the sum over v and w into congruence classes modulo A = [M, u,k]. Let 

^(A) = ^(p,q,s;A):= ^ ^(^'^)- 

Then we have 



w=q (mod A) vsp (mod A) 



=^m(s) = ^^(A), (8.10) 



p>q 



where the sum is over (p,q) G (Z/AZ)2" for which 

(p,q,s) = (vW,wW,uW)(modM) (8.11) 

and 

2-«F(p;q;s) = O(modtx), k \ ^k/l{v)- (8.12) 

Perhaps the most obvious way to deal with J^(A) is to approximate the sums over w and v 
by integrals. A simple change of variables would then permit us to extract the dependence 
of J^(A) on p, q, s and A. Instead of this it turns out that we can manage with a relatively 
crude direct comparison of =^(A) with =^(1), as we proceed to show. 

It follows from our conditions of summation that A <C A;u ^ KQ. It will be convenient to 
make the additional hypothesis 

KQ ^ V, (8.13) 
which implies in particular that A <C F. In view of (8.6) we have 

/oo 

w=q (mod A) v=p (mod A) 

where if = ai(v) and h = 6(w) are given by (8.2) then 

5(w) 



f{v,-w) := u; a2{v)x,-ai{v)x + 



«2(v). 

We will denote by r(p, q; A) the integrand that appears in this expression for -^^(A). Using 
an approach based on the proof of Lemma 10 in § 5 we compare r(p, q; A) with r(0, 0; A). 
For this we will assume without loss of generality that maxj=i,2 |0'i(v)| = |o.2(v)| in the sum 
over V. The alternative possibility is accommodated by a simple change of variables in the 
integral over x. Hence the definition of Y ensures that 02 (v) has order of magnitude y"/^ 
and by Lemma 9 we have x <^ iJ"/^ in J^(A). 

Under the change of variables w = w' + q we see that 



6(w) 6(w') 



A|w'|"-i Aiy-i 



^ |a2(v)| ^ F"/2 



02lVj a2lvj 

since |w'| <C A + <C by (8.13). We may therefore conclude from Lemma 9 that 

r(p,q;A)= E E 

v=p (mod A) w'sO (mod A) 

Note that the number of v appearing in the outer sum is 0(A~"F"). For w' such that 
w' + q G >^ it is clear that w' e W unless w' is within a distance A of the boundary of 'W. 
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Invoking Lemma 9 to deduce that / is bounded absolutely, and recalling that U = HV and 
W = iJi/V, it easily follows that 

A -2n+lT/n/2T^2n-l 

|r(p,q; A) - r(p,0; A)| « {^-'VY {A-^WT'^ + 

^ ^-2n+l^(n-l)/2y2n-l 

by (8.13). 

We now repeat the above process by considering the effect of a change of variables v = v'+p 
in T(p,0; A). Recalling that A < KQ ^ V, by (8.13), we obtain 

|ai(v) - ai(v')| < A|v'r/2-i ^ aW^-^, 

for i = 1, 2. In particular it follows that 

&(w) fe(w) 



02 (v) 02 (v') 



|a2(v)a2(v')| 

A-\/n/2-l^n 



= Aii""/^F"/^~-^, 

for any -w . An application of Lemma 9 reveals that 

AF^/^W^-^ 



r(p,0;A)= E /(v',w) + 



w=0 (mod A) v'sO (mod A) 



Un/2 



To control the error term we note that the total number of available v', w in the sums is 

A" V A 



A — 2n+l i:Tn/2T^3n/2— Ipr^n 
— lTJ/^^^/ A — lT^\n— 1 I 



We conclude that 

|r(p,0;A)-r(0,0;A)| « (A-iW-)"(A-V)-i + ^^^^ 

^ A~2n+l^n/2^2n— 1 

whence 

r(p, q; A) = r(0, 0; A) + O (^/^-■^n+l-^n/2y2n-l^^ 
Summing the latter estimate over p, q modulo A gives 

r(0, 0; 1) = A2"r(0, 0; A) + O (^Aif^/V^"-^) , 
which we substitute back in to get 

r(p, q; A) = A-2"T(0, 0; 1) + O (^/^--^n+l jjn/2y2n-l\^ _ 

Now the outer integral in our expression for ^(A) is over an interval of length 0{H'^^^). We 
have therefore arrived at the following result. 

Lemma 18. — Assume that KQ ^ V . Then we have 

y{A) = A-2"J^(1) + 0(A-2"+lii-"y2n-l)_ 
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This result will allow us to separate out what is in effect the "singular integral" associated 
to our counting problem. In the notation of (8.6) it is given by 

a^{G,H,V):=J^{l)= J2 E ^(^'^)- (8.14) 

we>rnz" vernz" 

It follows from (8.9) that 

(7oo(G, H, V) < (G- V)" • (G-^W)'^ ■ = Q-2njjny2n_ ^^ -^^^.^ 

It is interesting to compare the present situation with the singular integrals arising from 
typical applications of the Hardy-Littlewood circle method. These are expressed as volumes 
that reflect the real density of solutions. It transpires that we will be able to provide a 
lower bound for a^{G, H,V) which essentially matches the upper bound (8.15) without first 
approximating the sum by an integral. Nonetheless crucial use will be made of the fact that 
c7oo(G, H, V) features a sum over points close to a non-singular real point on the variety (3.5). 
For given k and u, let 

NMik,u) := #{p,q,s(modA) : (8.11), (8.12) hold}, (8.16) 

where A = [M, u, k]. We claim that 

NM{k,u)^^ , (8.17) 

for any ^ > 0. To see this we write A = Tr^^/Q {SlSi x/ l{p)^ K/ for fixed p, s modulo A. 
Then the number of q(modA) contributing to NM{k,u) is at most 

J2 #{q e [1, A]- : 2N^/Q(q) = N} «5 

7V<A" 
N=A (mod u) 

Moreover there are A" possibilities for s (mod A) and the number of available p (mod A) is 
at most 

#{p G [1, A]" : N^/q(p) = AT} «^ (8.18) 

N=0 (mod 

It therefore follows that 

NM{k, U) <J A" • ,„ = — -2-. 

Noting that uk S> A, we easily arrive at (8.17). 

The extraction of = a^(G, H,V) now follows on combining Lemmas 17 and 18 with 
(8.10) and (8.17). Putting these together, and assuming that KQ ^ V, we therefore deduce 
that 

^ = a^2-M- J2 E E ^^^^^Mik,u) 

k^K q^Q u\q 
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where NMik^u) is given by (8.16). Since KQ ^ 1/ it is clear that the error term is 

^6 Q2 



K 



We now show that the summation over k can be extended to infinity with acceptable error. 
It follows from (8.15) and (8.17) that the error in so doing is 

^-2nrrnT.2n -ST- V- « A^^-^+^/S Ql+»?^ny2n 
«, G ^^Hnv^n Y,Y.ll^ 1 «^ ■ 

k>Kq^Q u\q 

This error term is subsumed by that above when KQ ^ V . Choosing 

K = 



Q ' 

the hypothesis (8.13) becomes H^/'^ ^ V. Putting everything together we have therefore 
established the following result. 

Lemma 19. — Assume that iJ"/^ ^ V . Then we have 

^ = 2'^M"cJoo6(Q) + Or, (^g6^'^/2-^2n+2„j ^ 

where a^o = (^oo{G,H,V) is given by (8.14) and 



oo 



-EEM^)E^^^^M(fc'-)' 

q^Q k=l u\q 



with A = [M,u,k] and NM{k,u) given by (8.16). 



9. The singular series and integral 

It remains to produce satisfactory lower bounds for the quantities &{Q) and a^. For the 
former our strategy will be to show that as Q tends to infinity the sum &{Q) converges to a 
limit & which is a product of local factors. We will then show that each of these factors is 
positive. In view of (8.17) the sum 

fc=l u\q 

is absolutely convergent. We rearrange it as 

oo 

f{q) = J2 MqM E mA-^'^NMik, u) 

u\q k=l 

and write k = kik2 where ki | uM and gcd(A;2, uM) = 1. Then [M, u, k] = [M, u]k2 and 



/{(}) = ^un{q/u) E 




The function NM{kik2,u) factors as 

NM{kik2,u) = NM{ki,u)# {p,ci,s{iaodk2) : k2 \ ^k/l{p)} , 
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where A = [M,u] in NM{ki,u). We write 

R{k) :=#{p(modfc):A;|N;^/^(p)} 

and observe that 

R{k) and R{k) < A;" for A: > 1, (9.1) 

by the argument used for (8.18), and the fact that N/^ /q(1, 0, 0, . . . , 0) = 1. It follows that 



k2 k2 



with 



=^n (i-^)"^M(fci,.), 

p\uM ^ ^ 

-n(i-f^)>o. M 

We therefore see that f{q) = cfo{q,M) where 

/o(g, M) = Y^ u^{q/u) ^ n fl - ^) u) (9.3) 



u\q k\uM ^ ' p\uM ^ ^ ^ 

is a multiplicative function in the two variables q and M. We write ^ = and M = Y\p^, 
where /x = Vp{M) vanishes for all but finitely many primes. Then 

oo oo 
q=l p a=0 

Providing that this product converges we will be able to deduce that 



exists. Moreover we will have 



with 



lim 6(Q) = 6 



6 = cJJcTp 



<Tp = Y.foip'',Pn, (9-5) 

a=0 

and in order to prove that 6 > it will suffice to show that ap> for every prime p. 

Our treatment of the singular series will depend on the following two lemmas, which we 
will prove later in this section. It will be convenient to write x for the vector (p, q, s) and xq 
for (v(^), w(^),u(*^)). Moreover we shall write F(x) = 2~''F(p;q;s) and N(x) = N^/i,(p). 

Lemma 20. — Suppose that p^\\M . For any j3 ^ max{/x, 1} define 

M{p>^,p^) := #{x(modp'^) : x = xq (modp^), | F(x), p f N(x)}. 
Then we will have 

M{p, 1) = + 0(p3n-5/2^_ ^9_g) 
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Lemma 21. — For any prime p and a ^ 2 we have 

fo{p",P^) < (2a + lfya-3[a/2]-2_ 

We now prove that the product (9.4) converges. Lemma 21 yields 

oo oo 

Yl iMp'^^pni « + i)^>"-^["/2i-2 « p-\ 

a=2 a=2 

We note also that n is non-zero only for the primes in S, and that /o(l, 1) = 1 Thus a bound 
fo{p, 1) ^ p~3/2 ^-j^ suffice to establish absolute convergence. In general we have 

= E»E|^n (i-^)""p-(*.«) E "bM 

in view of (9.3). The final sum over q vanishes unless u = p^, and if /3 ^ max{/x, 1} our 
expression reduces to 



(9.7) 



j:fo(p''^pn=p-^'''-'^' (i - ^y'j:mNAk,p') 

= p-(3-i)/^ (i-^)''m(/,p'^). 



Taking (5 = 1 and = we obtain 

/0(P, 1) = (l - ^) " {Mb,, 1) -p3»-. (i _ 5M) 

The required estimate fo{p, 1) <C p"^/^ now follows from Lemma 20, since R{p)/p'^ <p-3/2 
by (9.1). This completes the proof of absolute convergence. 

We turn now to proof that Up > for every prime p. By (9.7) we have 

ap = hm Mp'',P^) = f 1 - ^) ' lim p'^'^-'^^ Mip^ ,p^). 

Thus dp > providing that M{p^,p^^) p(3"-i)/3 as /3 — > oo. 
Using Hensel's lemma it will follow that dp > if the variety 

F(v; w;u) = TVi/Q(,5N;^/i(u)N;^/i(v)) - 2N;^/q(w) = 0, 

introduced in (3.5), has a non-singular point over Zp which satisfies the constraints 

p\'^k/lM and (v,w,u) = (vW,wW,uW)(modM). (9.8) 

The function M{p, 1) counts points over Fp which lie on the above variety and satisfy the 
constraints (9.8), whether they are non-singular or not. The number of singular points will 
be 0(p^"~^), whence the estimate (9.6) shows that there will be a suitable non-singular point 
providing that p'^ Po, say. We can arrange that the set S includes all primes p < po, so that 
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Lemma 3 ensures the existence of p-adic integer solutions with v = (1,0, ... ,0) (modp) and 
Nk/q(w) 7^ 0. It fohows that p | ^k/l{^)- Finahy we prove, by contradiction, that any such 
point must be non-singular. For otherwise we would have 

Vwi^(v;w;u) = -2VwN^/q(w) = 

where Vw = (g^j • • • ) d^)- would then follow from Euler's identity that 

w.VwN^/q(w) = nN;^/Q(w) = 0, 

a contradiction. Thus we have suitable non-singular points for every prime p, and hence it 
follows that (Tp > for all primes p. 

We remark that, by combining (9.2), (9.5) and (9.7), we have 

with 

al := lim p-^^''^'^^^ M{p^ ,p^). 

Thus 6 is a standard product of local densities. 

It remains to establish Lemmas 20 and 21. To handle Lemma 20 we observe that the 
variety defined by -F(x) = takes the simple form 

-^1 • • • Xn + Xn+l ■ ■ ■ X2n + X2n+l ' ' ' -^3n = 

over Fp. This makes it clear that we have a hypersurface of projective dimension 3n — 2, whose 
singular locus has projective dimension 3n — 7. Here we use the fact that the singular locus 
consists of points where two or more coordinates vanish from each of the sets {Xi, . . . 

. . . , X2n}, and {X2n+i^ ■ ■ ■ , X^n}. According to the result of Hooley [19] the number 
of projective points modulo p differs from (p^n-i _ 2)/(p — i) by an amount 0(p^"~^), whence 
the number of points in A"(Fp) is p^n-i _j_ (9(j,3n-3^ follows that 

M{p, 1) = /"-^ + 0(/"~^) - #{x(modp) : p \ F(x), p | N(x)}. 

In the set on the right we have 

p I Tij^/QiSNK/M^K/d^l)) - 2N^/q(s) 

and 

P I Nk/l(p), 

from which it follows that p | 2Nx/q{s). It follows, by the argument leading to (8.18) that 
the number of possible p (modp) is 0^(p"~^+^) and that the number of possible s (mod;j) is 

0^{p''-^+^). We then see that 

#{x(modp) : p \ F(x), p \ N(x)} <^ p3n-3+2^ 

for any fixed ^ > 0, which gives us the required bound (9.6). 

Turning to the proof of Lemma 21 we will begin by supposing that 

a ^ max{2/x- 1,2}. (9.9) 

It follows that a > max{/x, 1}, whence (9.7) yields 

Mp'',Pn = i^"^(p)p-n {M{p'',p^)-p^^-^M{p^-\p^)) . (9.10) 
It will be appropriate to observe at this point that 1 — R{p)/p"' S> 1, which follows from (9.1). 
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We proceed to compare M{p'^,p'^) with M {p^~^^ , p^^) , using Hensel lifting. For t < e we 
define 

St{p^,p^) :={x(modp") : x = xq (modp^), | F(x), p f N(x), VF(x)}. 

Wlien t < e/2 and t ^ e — max{//, 1} one sees tliat if x G St{p^,p^) then x+p^~^y G St{p^,p^) 
for all y (modp*). Thus St{p^,p'^) is composed of cosets modulo p^"*. Moreover x-|-p^~*y will 
be in St{p^'^^ ,p^) for exactly p^^~^ choices of y (modp). It follows that each coset modulo 
pC-t St{p^-,p'^) lifts to exactly p^n-i ^Qgg^g modulo p<^+i^* in Stijf~^^ iP^), and hence that 



for t < e/2 and t ^ e — max{/Lt, 1}. 
We now write 

r(p*) = #{x(modp*):p*|VF(x)}, 

whence 

#{x(mod/) : p*|VF(x)} = p^<'^-^'^T{p*) 
for t ^ e. Then for any non-negative integer r ^ e we have 

( X = xq (modp^ 

M{p',pf')= #stip',pn+#< 

0^t<T 

whence 



(9.11) 



x(modp^) : p'^ \ -F(x), V , 

ptN(x),p-|VF(x) ^ 



Q^t<T 



^ #{x (modp"") : |VF(x)} = p 



3ra(e— t) 



Similarly we have 



^ p3n(e+l-r) j.(^r^ 



We therefore wish to use (9.11) for every value t < t. This will require that r — 1 < e/2 and 

T — 1 ^ e — maxjyU, 1}. This condition is equivalent to requiring that r — 1 ^ (e — l)/2 and 
T — 1 ^ e — max{//, 1}, or that r ^ (e + l)/2 and r ^ e — max{// — 1, 0}. Thus if 

i"e + 1" 

r = r(e) = min ■{ e — max{/x — 1,0}, ' 



then (9.11) yields 

\M{p^+\p'') -p^"-^M{p^,pf)\ ^ 2p^<^+^~^^T{p''). 

We proceed to insert this bound into (9.10). If e = cc— 1 we find that r(e) = [a/2] providing 
that a ^ max{2|Lt — 1, 1}. In fact we have made the stronger assumption (9.9), and we deduce 
that 

fo{p",P^) < p°-3«["/2]2-(p["/2]), (9 12) 

We have therefore reduced our problem to one of providing a suitable upper bound for T{p^). 
We now recall that 



If we set 



F(X) = 2-« {T,L/Q{5^K/L{mK/L{<\)) - 2N;^/q(s)) 

G(p,q) =TVi/Q(5N;^/i(p)N;f/i(q)) 
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it then follows that 

T{p')^T,{p')T2ip'), (9.13) 

where 

Tiip') := #{(p,q) (modp*) : VpG(p,q) = VqG(p,q) = O(mod;>*)} 

and 

T2{p') ■= #{s(modp*) : VN;^/q(s) = O(modp*)}. 

Wc begin by explaining our estimation of T2{p^). The treatment of Ti{p*) will then be in the 
same spirit, but a little more complicated. Over Q we have 

n 
i=l 

for certain linearly independent linear forms 

n 

-^i(s) = ^ CijSj 

with coefficients in the normal closure N, say, of K. Indeed our original choice of the basis 
coi, . . . ,iOn ensures that the Cij are algebraic integers. We now have 

d " 

— N^/q(s) = ^Cijlli, 
i=l 

where 

/ii := Li(s) • • • Li_i(s)Lj+i(s) • • • L„(s). 

Let C denote the matrix {cij)ij^n and write D = detC, so that is the discriminant of 
K. Suppose now that p* divides VN;^/q(s). Then if fj, is the column vector with elements 
Hi we will have C/x = O(modp*). This divisibility relation may be interpreted in the ring 
of integers for A'^. We now pre-multiply by the matrix C^'^-i, whose entries are algebraic 
integers, and use the fact that C^'^^C = (det C)/ to deduce that Dfx = O(modp*). We 
conclude that | D^i for each index i, where divisibility is again within the ring of integers 
of N. Then p*'* | Mi = £'"N;f/Q(s)"-\ and hence ^rWl^^-i)! | D^NK/Qis) whenever 

P' I VN;^/q(s). 
Let 

a = max { \tn/{n - 1)] - Vp{D'^) , 0} . 
Since t ^ a ^2t we then have 

r2(p*) =p"(*--)#{s(modp'^) : p' I VN^/q(s)} 

^ p"(*-^)#{s G n {0,pT ■■ I N;,/q(s)}. 

Suppose that (p) splits over K as 

(p) = Pi' ■ • • with NK/Q{pi)=pf\ 

for 1 ^ i ^ r. Let a = Yl^=i ^o^j-: so that a is a non-zero element of Ok- If we now set 
Vi = ■Up; {a) we deduce that 

r 

p'' \J{NK/^{Pir\ 
1=1 



52 



T.D. BROWNING & D.R. HEATH-BROWN 



whence a ^ "^fiVi- It then follows that a ^ fiQi, where gi = mm{vi, a}. Thus for each 
element s there are non-negative integers gi ^ a for which ^ figi ^ a and 

n 

This condition restricts s to a lattice A say, with p'^Z"' C A C Z", and with p"^ \ det(A). The 
number of choices for gi, . . . ,gr is at most (a + l)*" ^ (cr + l)". It therefore follows that 

since D is fixed. 

We now examine Ti(p*) in a similar way. The form G(p, q) takes the shape 

n/2 n/2 n n/2 

j=l i=l i=n/2+l i=n/2+l 

If we replace SLi{p) by i^i(p) and (^'^-^^n/2-i-«(p) by -^^n/2-i-i(p) we find that 



^g(p,q) 
dpj 



where 



and 



i=l 



n/2 

Hi = Li(p) • • • Li_i(p)Lj+i(p) • • • Ln/2{p)'[lLi{q) 

i=l 



for 1 ^ z ^ n/2 and 



W = ^n/2+l (P) • • • Ln/2+i-lip)Ln/2+i+l {p) ' ' ' -^n (p) (q) 

«=n/2+l 

for n/2 < i ^ n. Thus | Djii for 1 < ? < n, where is the determinant of the 
matrix C = {cij)ij^n- In particular D is a non-zero algebraic integer. Taking the prod- 
uct for i ^ n yields p*" | D"-N x/q{p)"'^'^^^'^ K/Qi<l)'"^^ ■ By symmetry we also obtain 
ptn I D"N;^/Q(q)'^/2-iN^/Q(p)«/2^ and hence 



It follows that I N;^/Q(p)N^/Q(q) with 

a = max { \2tn/{n - 1)] - Vp{D^) , 0} . 

We can now complete the argument as before. This time there will be two sets of non- 
negative exponents g^ and g'^, say, corresponding to p and q respectively, and such that 
fii9i + 9i) ^ Thus the factor (a + l)" must be replaced by (a -|- 1)^". For each such set 
of exponents we find that (p, q) is restricted to a sublattice of Z^" of index at least p^, and 
we deduce as before that 

ri(p*) < {At + lfnp2nt-\2tn/in-l)] _ 
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In view of (9.13) and (9.14) we deduce that 

Tip*) < (4t + I)3"p3nt-[tn/(n-l)]-r2tn/(n-l)] ^ _|_ j^3np3nt-3i-2^ 

since if m G Z we have [0] ^ m + 1 for any real number > m. It now follows from (9.12) 
that 

/o(p",P^) < (2a + l)3«p"-3[a/2]-2^ 

and this suffices for Lemma 21 when (9.9) holds. Finally, it is clear in fact that we still have 
this estimate if 2 ^ a ^ 2;U — 1, since if /U > then p belongs to the finite set of divisors of 
M. This therefore completes the proof of Lemma 21. 

Our final task in this section is to establish a lower bound for a^o to complement the upper 
bound in (8.15). For any c G let B{c;p) C M? denote the box centered on c with side 
length 2p. The final part of Lemma 9 implies that there exists p 3> G~^U"'^'^ such that 

for every {xi,X2) G B{6Nk/i{Uu^^^); p). Here we view 5N;^/£^(J7uW) as a vector (ci,C2) in 
M?. In this way we deduce from (8.6) and (8.14) that 

aoo»G2-" J2 Yl "^(^5^)' (9.15) 
we5rnz" vernz" 

where if Oj = ai(v) and 6 = 6(w) are given by (8.2) then 

J(v; w) := meas |x G M : ( a2(v)x, — ai(v)x + -^^^-r- J G B{c; p)\ . 

[ \ «2(v)y J 

The minimum distance from the line {a2X,—aix + 6/02) to the point (ci,C2) is equal to 
\aici + a2C2 — b\/ \/a^~+a|. Suppose now that we have points v and w satisfying 

I V - y v(^) I < AG- V and |w - VTw^ | < XQ-^W (9.16) 

for some A ^ 1. If we set 

af ) = TV^/q(N;,/^(vW)), af ) = TV^/Q(rN^/^(vW)) 

and 

6W = 2N^/q(wW), 

then 

ai(v) = y"/2af ) + 0{XG-W/^), a2(v) = ^"/^^f ^ + ©(AG" V"/^) 

and 

6(w) = "W^"6W + 0(AG-^I^"). 

Moreover 

W2af ) + c2F"/2af ) - I^"6W = F(FvW; T^w^; t/uW) = 0. 

Thus \aici + a2C2 — b\ = OiXO'^W^), while af + 03 > V"' since a^^'^'' , 03^^^ are not both zero. 
Since p » G-^J/^/^, it follows that 

|aici + a2C2 -b\ p 



providing that we take A as a sufficiently small positive constant. 
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Wc now see that, for points (v, w) satisfying (9.16), the hne {a2X,—a\x + h/a2) meets the 
disc B{5'Nx/l{Uvl^^'')] p) in a segment of length ^ p, so that 

J(v;w) » , „ » » 

^ ^ max{|ai|, |a2|} F^/^ 

The number of available points (v,w) is » G~^'"{VW)"' and we therefore conclude from 
(9.15) that 

(Too > 

This should be compared with the upper bound (8.15). 

Bringing together our lower bounds for 6 and in Lemma 19, we deduce that 

provided that //"/^ ^ V. This therefore leads to the following conclusion. 
Lemma 22. — Let G = log V. Assume that H"'/'^ ^ V and 

Then we have 

^{G,H,V) > (log"l/)i-3'^ii'"y2n_ 

Recalling (3.18) and our choice G = logV^, it is now time to select parameters Q,H,V 
such that ^{G,H,V) = o{^{G,H,V)). We will choose Q = H^^-^^^^, with which choice 
Lemma 22 implies that ^{G,H,V) » {logV^-^'' H^'V^'' , if 

In line with Lemma 3 we let V run through large integers congruent to 1 modulo M. Next 
we choose Hq = 1 + M[F^/(^°" )], which is a positive integer congruent to 1 modulo M. But 

then H = Hq has order ) and so Q = //("'^i)/!^ satisfies the conditions of Lemma 16. 

This implies that the required estimate for S{G, H, V) holds and so completes the proof of 
Theorem 2. 
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